
Answer-first summary for fast verification
Answer: Prioritize uncovering the root causes of the incident over identifying the individual at fault.
Adhering to SRE practices during a post-mortem means concentrating on the systemic and contributing factors behind an incident, rather than assigning blame to individuals. This approach fosters a culture focused on learning and systemic improvement, helping to uncover and address underlying issues that may contribute to future incidents.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
After a service outage caused by a new release that excessively used memory resources, a rollback was successfully executed to minimize user impact. As the post-mortem lead, how can you ensure the investigation aligns with Site Reliability Engineering (SRE) principles?
A
Focus on introducing new features to prevent future outages rather than addressing the root causes.
B
Prioritize uncovering the root causes of the incident over identifying the individual at fault.
C
Review Git history to locate the problematic commit and restrict the involved engineer from working on production services.
D
Conduct individual meetings with all engineers involved to determine who approved and deployed the new release to production.
No comments yet.