
Answer-first summary for fast verification
Answer: Ensure that test cases that catch errors of this type are run successfully before new software releases.
In the context of Site Reliability Engineering (SRE) principles, the focus is on blameless postmortems and improving systems to prevent future incidents rather than assigning blame. Option A suggests escalating to senior management, which contradicts the blameless culture. Option C prescribes practices to employees, which may not address systemic issues. Option D focuses on incident response rather than prevention. Option B, ensuring test cases catch errors before releases, directly addresses preventing similar incidents by improving the system's reliability and is aligned with SRE principles.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
Your organization adheres to Site Reliability Engineering (SRE) practices. While drafting a postmortem for a user-impacting incident caused by a software change, you aim to mitigate future severe incidents. What steps should you take?
A
Identify engineers responsible for the incident and escalate to their senior management.
B
Ensure that test cases that catch errors of this type are run successfully before new software releases.
C
Follow up with the employees who reviewed the changes and prescribe practices they should follow in the future.
D
Design a policy that will require on-call teams to immediately call engineers and management to discuss a plan of action if an incident occurs.
No comments yet.