
Answer-first summary for fast verification
Answer: Store the data in an Amazon S3 bucket. Process and transform the data by using S3 Object Lambda before returning the data to the requesting application.
## Explanation **Correct Answer: B** **Why Option B is correct:** 1. **S3 Object Lambda** allows you to add custom code to process data as it's being retrieved from S3, without needing to create and maintain separate copies of the data. 2. **Least operational overhead**: With S3 Object Lambda, you store the data once in a single S3 bucket, and the Lambda function can dynamically remove PII for the two applications that don't need it, while preserving PII for the one application that does. 3. **No data duplication**: You avoid storing three separate copies of the data, which reduces storage costs and complexity. 4. **Centralized processing**: The PII removal logic is centralized in the Lambda function, making it easier to maintain and update. **Why other options are incorrect:** - **Option A**: Creating a proxy application layer adds significant operational overhead for development, deployment, and maintenance of the proxy service. - **Option C**: Storing three separate copies of the data in S3 buckets increases storage costs, creates data synchronization challenges, and requires upfront processing to create all three datasets. - **Option D**: Similar to Option C, but with DynamoDB which is less suitable for terabyte-scale data storage and would be more expensive for this use case. Creating three separate tables also adds complexity and cost. **Key AWS Services:** - **Amazon S3**: Ideal for storing terabytes of data cost-effectively - **S3 Object Lambda**: Allows on-the-fly data transformation without modifying the original data - **AWS Lambda**: Serverless compute for processing data **Security Consideration:** This solution ensures PII is only exposed to authorized applications while minimizing data duplication and operational complexity.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
An ecommerce company stores terabytes of customer data in the AWS Cloud. The data contains personally identifiable information (PII). The company wants to use the data in three applications. Only one of the applications needs to process the PII. The PII must be removed before the other two applications process the data.
Which solution will meet these requirements with the LEAST operational overhead?
A
Store the data in an Amazon DynamoDB table. Create a proxy application layer to intercept and process the data that each application requests.
B
Store the data in an Amazon S3 bucket. Process and transform the data by using S3 Object Lambda before returning the data to the requesting application.
C
Process the data and store the transformed data in three separate Amazon S3 buckets so that each application has its own custom dataset. Point each application to its respective S3 bucket.
D
Process the data and store the transformed data in three separate Amazon DynamoDB tables so that each application has its own custom dataset. Point each application to its respective DynamoDB table.