Databricks Certified Generative AI Engineer - Associate

Ultimate access to all questions.

A Generative AI Engineer built an LLM application using the provisioned throughput Foundation Model API. The application is ready for deployment, but the request volume is too low to justify a dedicated provisioned throughput endpoint. Which strategy should they use to ensure the best cost-effectiveness?

Exam-Like

Switch to using External Models instead

0.0%

Deploy the model using pay-per-token throughput as it comes with cost guarantees

Loading comments...

Change to a model with a fewer number of parameters in order to reduce hardware constraint issues

1.1%