
Answer-first summary for fast verification
Answer: Deploy the model to Vertex AI private endpoints in the us-central1 and asia-southeast1 regions, and allow the application to choose the appropriate endpoint.
The correct answer is C. Deploying the model to Vertex AI private endpoints in both the us-central1 and asia-southeast1 regions allows the application to choose the nearest endpoint. This approach will significantly reduce network latency for customers in Singapore since it brings the model closer to them. Additionally, by using Vertex AI private endpoints, you ensure secure and efficient communication between the application and the model. Option D also includes creating endpoints in asia-southeast1 but does not leverage the existing infrastructure in the us-central1 region as efficiently as Option C. The use of GPUs (Option A) or larger machines (Option B) addresses computation power but does not directly mitigate the network latency issue caused by the geographical distance.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
You work for a large bank that offers online services to customers through an application hosted on Google Cloud, with instances running in both the US (us-central1) and Singapore (asia-southeast1) regions. To enhance security, you have developed a PyTorch model designed to classify transactions as potentially fraudulent or not. The model is a three-layer perceptron that takes both numerical and categorical features as input, with hashing operations performed within the model itself. The model has been deployed to the us-central1 region on n1-highcpu-16 machines, providing real-time predictions with a current median response latency of 40 ms. However, customers in Singapore are experiencing higher latencies, and you aim to reduce these delays. What should you do to improve latency, particularly for users in Singapore?
A
Attach an NVIDIA T4 GPU to the machines being used for online inference.
B
Change the machines being used for online inference to n1-highcpu-32.
C
Deploy the model to Vertex AI private endpoints in the us-central1 and asia-southeast1 regions, and allow the application to choose the appropriate endpoint.
D
Create another Vertex AI endpoint in the asia-southeast1 region, and allow the application to choose the appropriate endpoint.