AWS Certified AI Practitioner

Get started today

Ultimate access to all questions.

Deep dive into the quiz with AI chat providers.

We prepare a focused prompt with your quiz and certificate details so each AI can offer a more tailored, in-depth explanation.

A company is developing a chatbot using Amazon Bedrock. Before sending user input to a foundation model, the text must be broken down into smaller pieces that the model understands. What is this process called?

Real Exam

Community

JJin

Last updated: February 18, 2026 at 14:05

Stemming

Tokenization

Vectorization

Stopword removal

Explanation:

Tokenization is the process of breaking down text into smaller units called tokens that a language model can understand. In the context of foundation models like those used in Amazon Bedrock:

Tokenization converts raw text into a sequence of tokens (words, subwords, or characters) that the model processes.
Stemming reduces words to their root form (e.g., "running" → "run"), which is a different preprocessing technique.
Vectorization converts text into numerical vectors, typically for traditional ML models.
Stopword removal eliminates common words like "the", "and", "is" that may not add significant meaning.

For foundation models in Amazon Bedrock, tokenization is the essential first step where input text is converted into tokens that the model's architecture can process.

Powered ByGPT-5.2

Comments

Loading comments...