
Answer-first summary for fast verification
Answer: Redirect traffic away from the affected region to the other two regions while draining existing connections.
The correct approach is to drain requests from the affected region and redirect them to the other two regions. This strategy allows the system to continue operating optimally under the circumstances, providing time to correct the errors in the affected region and apply a new patch. Other options, such as restarting instances or increasing their number, do not address the immediate issue and may prolong the Mean Time to Repair (MTTR). Increasing instances in the problematic region would only propagate the error, as new instances would inherit the same flawed patch. For more details, refer to resources on Effective Troubleshooting and Enabling Connection Draining.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
Imagine your Site Reliability Engineering (SRE) team is overseeing an application deployed across three regions, utilizing Managed Instance Groups behind a global HTTP(S) Load Balancer. During the application of a critical security patch to Compute Engines, the first two regions are successfully updated, but an error in the third region causes request failures. To minimize user impact from this unsuccessful patch, what is the best course of action?
A
Rollback the changes made to the affected region.
B
Increase the instance count in the problematic region.
C
Initiate a restart of all instances in the affected region.
D
Redirect traffic away from the affected region to the other two regions while draining existing connections.
No comments yet.