EIVUS

Planning Maintenance Windows

Schedule downtime, communicate with users, and have a rollback plan.

Back to blog

Choose a low-traffic window and announce in advance. List steps and dependencies; have a rollback procedure. For HA systems, do rolling updates if possible. After maintenance, verify and monitor before closing the window.

When and how to announce

  • Window: Pick a time with historically low traffic (e.g. night or weekend). Consider time zones if users are global. Agree with stakeholders and provider (e.g. network maintenance).
  • Announce: Notify users days or weeks ahead via email, status page, or in-app banner. State start and end (with buffer), expected impact (full outage vs degraded), and where to get updates.
  • Reminder: Send a short reminder 24h before. If you must reschedule, communicate clearly and set a new date.

Execution

  • Checklist: Write step-by-step instructions: pre-checks, commands or changes, post-checks. Include dependencies (e.g. "after DB upgrade, restart app"). Reduces mistakes and speeds rollback.
  • Rollback: Define what "rollback" means (revert config, restore backup, fail back to old version) and the exact steps. Test the rollback path in staging if possible. If in doubt, roll back and reschedule.
  • HA and rolling: If you have multiple app or DB nodes, consider rolling updates: update one node at a time while others serve traffic. Reduces or eliminates downtime but requires compatible versions and health checks.

After the window

  • Verify: Run smoke tests or health checks. Confirm services are up and data is consistent. Check logs for errors.
  • Monitor: Watch metrics and alerts for the next hours. Sometimes issues appear only under load or after caches expire.
  • Close: Update status page and notify users that maintenance is complete. Document what was done and any follow-up items (e.g. cleanup next week).

Summary

Choose a low-traffic window and announce in advance. List steps and dependencies; have a rollback procedure. Use rolling updates for HA when possible. After maintenance, verify, monitor, and close the window formally.

Clients who trust us