You are a machine learning engineer at a company that has developed an ML model to detect the sentiment of users' posts on your company's social media page. The goal is to identify potential outages or bugs through sentiment analysis. For real-time predictions, you are using Dataflow to process data ingested from Pub/Sub. You plan to update the model regularly with new training iterations and keep the latest two versions live after every update. You need to split the incoming traffic between the two versions in an 80:20 ratio, with the newest model getting 80% of the traffic. Additionally, you want the solution to require minimal management and keep the pipeline as simple as possible. What should you do?

Exam-Like

Deploy the models to a Vertex AI endpoint using the traffic-split=0=80, PREVIOUS_MODEL_ID=20 configuration.

80.6%

Wrap the models inside an App Engine application using the --splits PREVIOUS_VERSION=0.2, NEW_VERSION=0.8 configuration

4.8%

Wrap the models inside a Cloud Run container using the REVISION1=20, REVISION2=80 revision configuration.

6.5%

Implement random splitting in Dataflow using beam.Partition() with a partition function calling a Vertex AI endpoint.

8.1%

Google Professional Machine Learning Engineer