
Ultimate access to all questions.
A Generative AI Engineer has deployed an LLM application at a manufacturing company to assist with customer service inquiries. They need to identify the key enterprise metrics for monitoring the application in production.
Which of the following is NOT a metric they would implement for their customer service LLM application in production?
A
Massive Multi-task Language Understanding (MMLU) score
B
Number of customer inquiries processed per unit of time
C
Factual accuracy of the response
D
Time taken for LLM to generate a response