Google Professional Cloud DevOps Engineer

Ultimate access to all questions.

You manage a widely-used mobile game application running on Google Kubernetes Engine (GKE) across multiple Google Cloud regions, with each region containing several Kubernetes clusters. A report indicates that users in a specific region cannot connect to the application. Following Site Reliability Engineering (SRE) principles, what is the first action you should take to resolve this incident?

Exam-Like

Reroute the user traffic from the affected region to other regions that don't report issues.

66.7%

Use Stackdriver Monitoring to check for a spike in CPU or memory usage for the affected region.

0.0%

Loading comments...