Databricks Certified Generative AI Engineer - Associate

Get started today

Ultimate access to all questions.

A Generative AI Engineer has developed a RAG application to help employees interpret HR documentation. The prototype is functional and has received positive initial feedback from internal testers. How should the engineer now formally evaluate the system's performance and identify areas for improvement?

Exam-Like

Last updated: February 8, 2026 at 14:02

Use ROUGE score to comprehensively evaluate the quality of the final generated answers.

3.6%

Use an LLM-as-a-judge to evaluate the quality of the final answers generated.

9.8%

Comments

Loading comments...

Curate a dataset that can test the retrieval and generation components of the system separately. Use MLflow’s built in evaluation metrics to perform the evaluation on the retrieval and generation components.

84.8%