
Answer-first summary for fast verification
Answer: Investigate the issue, and if it continues, assign an incident commander.
Option A is not the first step because the actual problem isn't yet identified. Option B is too narrow, focusing solely on the technical fix without considering the broader incident management process. Option C, while important, is a subsequent step after ensuring the application's stability. Option D is correct as it aligns with Google's recommended approach: first investigate the issue, and if it persists, appoint an incident commander to oversee the resolution process. Reference: [Google SRE Book - Managing Incidents](https://sre.google/sre-book/managing-incidents/)
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
While on-call for managing a production application, you receive alerts indicating the application is failing uptime checks. According to SRE best practices for incident management, what should be your initial action?
A
Inform your team lead immediately.
B
Dive into fixing the issue right away.
C
Conduct a root cause analysis to understand the problem.
D
Investigate the issue, and if it continues, assign an incident commander.
No comments yet.