
Ultimate access to all questions.
You need to deploy a scikit-learn classification model to production. The model must serve requests 24/7, and you expect millions of requests per second to the production application between 8 AM and 7 PM. You need to minimize the cost of deployment. What should you do?
A
Deploy an online Vertex AI prediction endpoint. Set the max replica count to 1
B
Deploy an online Vertex AI prediction endpoint. Set the max replica count to 100
C
Deploy an online Vertex AI prediction endpoint with one GPU per replica. Set the max replica count to 1
D
Deploy an online Vertex AI prediction endpoint with one GPU per replica. Set the max replica count to 100