You have developed a language understanding model for a virtual assistant that can handle various intents such as 'book_flight', 'check_weather', and 'play_music'. You want to test the model's performance using a set of test data. Which of the following approaches should you use to evaluate the model's accuracy?

Simulated

Last updated: February 14, 2026 at 14:02

Manually review the model's predictions for each test sample and compare them to the expected results.

0.0%

Use a simple majority voting system to determine the model's predictions for each test sample.

0.0%

Calculate the percentage of test samples for which the model's predictions match the expected results.

20.0%

Measure the model's performance using a combination of precision, recall, and F1-score metrics.

80.0%

Microsoft Certified Azure AI Engineer Associate - AI-102