
Ultimate access to all questions.
A company is using Amazon Bedrock's automatic model evaluation to assess a generative text summarization model they built. Which metric should they use to measure the model's accuracy?
A
Area Under the ROC Curve (AUC) score
B
F1 score
C
BERTScore
D
Real world knowledge (RWK) score