
Ultimate access to all questions.
You are responsible for deploying a new scikit-learn classification model into a production environment for a company application. The system must be reliable and capable of handling high traffic loads, processing millions of requests per second from 8 am to 7 pm every day. Given these requirements, you must also focus on minimizing overall deployment costs while ensuring the system's performance and uptime. What should you do?
A
Deploy an online Vertex AI prediction endpoint. Set the max replica count to 1
B
Deploy an online Vertex AI prediction endpoint. Set the max replica count to 100
C
Deploy an online Vertex AI prediction endpoint with one GPU per replica. Set the max replica count to 1
D
Deploy an online Vertex AI prediction endpoint with one GPU per replica. Set the max replica count to 100