SEV-2: Elevated errors after ASTRYX asset assignment release
publishedPost-deploy verification detected a sharp increase in 500 responses on the asset assignment path shortly after production rollout.
Impact
Elevated error rate for assignment reads for ~10 minutes; rollback completed before customer data corruption risk.
Root cause
Incorrect join assumption under partial tenant backfill — code path assumed migrated rows for all tenants.
Timeline
- T+0: Deploy completed; canary healthy
- T+4m: Error rate breach vs rolling baseline
- T+6m: Rollback recommended in ReleasePilot
- T+12m: Rollback complete; metrics green
Action items
- Add tenant backfill completeness gate to preflightPlatform · due 2026-05-15 · open
- Extend integration tests with partial migration fixturesASTRYX team · due 2026-05-10 · open