Ultimate access to all questions.
As a GCP engineer for a legal cricket betting app, you're managing an auto-scaling managed instance group that scales up when CPU utilization exceeds 85%, with a cap of 6 VMs. The app takes 3 minutes to initialize, but the HTTP health check starts after just 30 seconds, leading to over-provisioning. How can you prevent unnecessary resource use?