AWS Certified AI Practitioner

Get started today

Ultimate access to all questions.

Explanation:

The correct answer is D) Semi-supervised learning.

Explanation:

Semi-supervised learning is the appropriate approach for this scenario because:

The company has "massive amounts of unlabeled posts" - this is the key characteristic
They want to learn contextual word relationships from this unlabeled data
They plan to fine-tune the model later for sentiment analysis (which would require some labeled data)

Why other options are incorrect:

A) Transfer learning: This involves using a pre-trained model on one task and adapting it to another related task. While related, it doesn't specifically address the use of massive unlabeled data.
B) Reinforcement learning: This involves learning through trial-and-error interactions with an environment to maximize rewards, which is not applicable to learning from static text data.
C) Self-supervised learning: This is actually a closer alternative, but self-supervised learning typically creates its own labels from the data (like predicting masked words). However, the scenario specifically mentions "unlabeled posts" and the need to learn contextual relationships before fine-tuning, which aligns more with semi-supervised learning where a small amount of labeled data is combined with large amounts of unlabeled data.

Key distinction: Semi-supervised learning leverages both labeled and unlabeled data, which matches the company's plan to use unlabeled posts first, then fine-tune (with labeled data) for sentiment analysis.

Explanation:

The correct answer is D) Semi-supervised learning.

Explanation:

Semi-supervised learning is the appropriate approach for this scenario because:

The company has "massive amounts of unlabeled posts" - this is the key characteristic
They want to learn contextual word relationships from this unlabeled data
They plan to fine-tune the model later for sentiment analysis (which would require some labeled data)

Why other options are incorrect:

A) Transfer learning: This involves using a pre-trained model on one task and adapting it to another related task. While related, it doesn't specifically address the use of massive unlabeled data.
B) Reinforcement learning: This involves learning through trial-and-error interactions with an environment to maximize rewards, which is not applicable to learning from static text data.
C) Self-supervised learning: This is actually a closer alternative, but self-supervised learning typically creates its own labels from the data (like predicting masked words). However, the scenario specifically mentions "unlabeled posts" and the need to learn contextual relationships before fine-tuning, which aligns more with semi-supervised learning where a small amount of labeled data is combined with large amounts of unlabeled data.

Comments (0)

No comments yet.

A social media company wants to train a large language model using massive amounts of unlabeled posts to learn contextual word relationships before fine-tuning for sentiment analysis. Which approach should they use?

Real Exam

Community

JJin

Last updated: February 18, 2026 at 14:05

Transfer learning

15.8%

Reinforcement learning

10.5%

Self-supervised learning

31.6%

Semi-supervised learning

42.1%