Incident Response Playbooks for Multi-CDN
Synopsis This chapter provides standard playbooks for incidents in multi-CDN environments. It covers detection, triage, scoping by geography and network, isolation and reroute choices, change control during active events, communication, restoration, and post-incident analysis. The objective is to protect users first, keep changes reversible, and leave an audit trail that improves future responses. Principles Incident handling favors user outcomes over internal metrics. Actions modify the smallest scope that achieves protection. All changes must be reversible. Each action records who acted, what changed, and why. Telemetry drives decisions and distinguishes symptoms from causes. Providers are treated as interchangeable routes unless a risk register documents exceptions. ...