Google Professional Machine Learning Engineer

Google Professional Machine Learning Engineer

Get started today

Ultimate access to all questions.


Your organization uses a scikit-learn fraud detection model deployed on a Vertex AI endpoint, which is currently configured with one e2-standard-2 machine (2 vCPUs, 8 GB memory). You observe that incoming traffic can spike to four times the endpoint's current capacity. What is the most cost-effective way to handle this?