AWS Certified AI Practitioner

Get started today

Ultimate access to all questions.

Explanation:

Detailed Explanation

This question asks about selecting an appropriate machine learning model for filling in missing words in text documents, which is a specific natural language processing (NLP) task.

Analysis of Options:

A: Topic modeling - Topic modeling algorithms like LDA (Latent Dirichlet Allocation) are designed to discover abstract topics within a collection of documents. They identify patterns of word co-occurrence to group documents by thematic content. While useful for document categorization and content analysis, topic modeling does not predict or fill in specific missing words within sentences. It operates at the document level rather than the word-level prediction required here.

B: Clustering models - Clustering algorithms (like K-means, hierarchical clustering) group similar data points together based on features. In text applications, they might group similar documents or sentences. However, clustering is fundamentally an unsupervised technique for grouping, not for predicting missing words or generating text completions. It lacks the predictive capability needed for this word-filling task.

C: Prescriptive ML models - Prescriptive models go beyond prediction to recommend actions or decisions. They're typically used in optimization, recommendation systems, or decision support contexts (e.g., suggesting optimal inventory levels or treatment plans). While powerful in their domain, prescriptive models are not specifically designed for natural language word prediction tasks like filling missing text.

D: BERT-based models - This is the correct choice. BERT (Bidirectional Encoder Representations from Transformers) is a transformer-based language model specifically designed for understanding context in text. Its architecture includes:

Masked Language Modeling (MLM): During pre-training, BERT learns by predicting randomly masked words in sentences, which directly aligns with the requirement of suggesting words for missing text.
Bidirectional context understanding: Unlike traditional language models that process text sequentially, BERT considers both left and right context simultaneously, enabling more accurate word predictions based on surrounding text.
Contextual embeddings: BERT generates word representations that vary based on context, allowing it to suggest appropriate words that fit the specific semantic and syntactic structure of the incomplete text.

BERT-based models have been successfully fine-tuned for various NLP tasks including text completion, question answering, and masked word prediction—making them ideally suited for this specific requirement of suggesting potential words to fill missing text in documents.

Why BERT is Optimal:

Direct alignment with task: The masked language modeling objective in BERT's training directly corresponds to the problem of filling missing words.
Contextual understanding: The bidirectional attention mechanism allows the model to consider the entire sentence context when suggesting missing words.
Proven effectiveness: BERT and its variants have demonstrated state-of-the-art performance on various language understanding tasks, including those requiring word prediction in context.
Practical applicability: Pre-trained BERT models can be fine-tuned with the company's specific documents to improve accuracy for their domain-specific vocabulary and writing style.

The other options represent valuable ML approaches but are not designed for the specific word-prediction task described in the requirement.

Explanation:

Detailed Explanation

This question asks about selecting an appropriate machine learning model for filling in missing words in text documents, which is a specific natural language processing (NLP) task.

Analysis of Options:

Masked Language Modeling (MLM): During pre-training, BERT learns by predicting randomly masked words in sentences, which directly aligns with the requirement of suggesting words for missing text.
Bidirectional context understanding: Unlike traditional language models that process text sequentially, BERT considers both left and right context simultaneously, enabling more accurate word predictions based on surrounding text.
Contextual embeddings: BERT generates word representations that vary based on context, allowing it to suggest appropriate words that fit the specific semantic and syntactic structure of the incomplete text.

Why BERT is Optimal:

Direct alignment with task: The masked language modeling objective in BERT's training directly corresponds to the problem of filling missing words.
Contextual understanding: The bidirectional attention mechanism allows the model to consider the entire sentence context when suggesting missing words.
Proven effectiveness: BERT and its variants have demonstrated state-of-the-art performance on various language understanding tasks, including those requiring word prediction in context.
Practical applicability: Pre-trained BERT models can be fine-tuned with the company's specific documents to improve accuracy for their domain-specific vocabulary and writing style.

The other options represent valuable ML approaches but are not designed for the specific word-prediction task described in the requirement.

Comments (0)

No comments yet.

A company has documents with missing words due to a database error. They want to build a machine learning model that can propose possible words to fill in the gaps in the text. What type of model is suitable for this requirement?

Exam-Like

Last updated: February 8, 2026 at 20:17

Topic modeling

10.0%

Clustering models

10.0%

Prescriptive ML models

10.0%

BERT-based models