Ultimate access to all questions.
You are tasked with setting up a data ingestion pipeline from an Amazon Kinesis stream to an Amazon Redshift cluster. The data is highly sensitive and requires specific configurations to ensure data integrity and security. Describe the steps you would take to implement this pipeline, including the use of AWS Lambda for data transformation and Amazon S3 for intermediate storage.
Explanation:
Option B is correct because it ensures data transformation and intermediate storage in S3, which is a best practice for handling large volumes of data and ensuring data integrity. Directly loading data from Kinesis to Redshift without transformation or intermediate storage (Option A) is not recommended for sensitive data. Using Kinesis Firehose without transformation (Option C) does not address the requirement for data transformation. AWS Glue is not suitable for real-time data processing (Option D).