
Explanation:
Based on the scenario described, the company needs to perform inference on large archived datasets (multiple GBs in size) and does not require immediate access to predictions. Let's evaluate each SageMaker inference option against these requirements:
Batch Transform is specifically designed for processing large volumes of data stored in Amazon S3. It processes data in batches without requiring a persistent endpoint, making it ideal for:
This aligns perfectly with the company's requirements for analyzing archived data without immediate prediction needs.
Real-time Inference requires a persistent endpoint that serves predictions with low latency (typically milliseconds). This is unsuitable because:
Serverless Inference provides automatic scaling with no infrastructure management, but:
Asynchronous Inference queues requests and processes them asynchronously, which:
Batch Transform (Option A) is the optimal choice because it's specifically engineered for offline batch processing of large datasets stored in S3. It provides the most cost-effective, scalable solution when immediate predictions aren't required, making it ideal for analyzing archived data in bulk. The other options are designed for scenarios requiring faster response times or different workload patterns.
Ultimate access to all questions.
No comments yet.
Which Amazon SageMaker inference option is appropriate for performing inference on large archived datasets of multiple GBs in size when real-time access to predictions is not required?
A
Batch transform
B
Real-time inference
C
Serverless inference
D
Asynchronous inference