
Google Professional Machine Learning Engineer
Get started today
Ultimate access to all questions.
Your organization uses a scikit-learn fraud detection model deployed on a Vertex AI endpoint, which is currently configured with one e2-standard-2 machine (2 vCPUs, 8 GB memory). You observe that incoming traffic can spike to four times the endpoint's current capacity. What is the most cost-effective way to handle this?
Your organization uses a scikit-learn fraud detection model deployed on a Vertex AI endpoint, which is currently configured with one e2-standard-2 machine (2 vCPUs, 8 GB memory). You observe that incoming traffic can spike to four times the endpoint's current capacity. What is the most cost-effective way to handle this?
Exam-Like