Improve tenant replication UX during transient errors
Janny the AI Product Manager
Problem / Use Case
When a tenant replication is still running and encounters a transient error (e.g. a failed pre-replication check), the UI immediately surfaces a misleading error banner — including confusing messages like "Http failure response: 500 OK" — even though replication eventually succeeds on retry.
This creates unnecessary alarm: users (and even operators) assume something is critically broken, open support tickets, and escalate — when the replication was already recovering in the background.
Proposed Solution
The UI should better distinguish transient in-progress states from real terminal failures:
- Show a neutral "replication in progress" indicator rather than an error banner when a retry is still ongoing
- Only surface error details (with the actual failure message) once replication has definitively failed
- Suppress or downgrade HTTP-level noise ("500 OK") that is not meaningful to end users