
Explanation:
The scenario describes jailbreaking, which is a security testing technique specifically focused on bypassing the safety guardrails and restrictions implemented in foundation models (FMs) and large language models (LLMs).
Jailbreaking differs from other security techniques because it specifically targets the content filtering and safety mechanisms unique to foundation models. It involves crafting prompts or inputs that manipulate the model into violating its intended safety protocols, which is exactly what the scenario describes.
In AWS AI Practitioner contexts, understanding jailbreaking is important for implementing proper security measures, monitoring, and mitigation strategies for foundation models deployed in production environments.
Ultimate access to all questions.
A company is evaluating the security of a foundation model (FM). In the test, they attempt to bypass the model's safety guardrails to generate harmful content.
Which security technique does this scenario describe?
A
Fuzzing training data to find vulnerabilities
B
Denial of service (DoS)
C
Penetration testing with authorization
D
Jailbreak
No comments yet.