
Ultimate access to all questions.
You are working on a project that involves processing large volumes of social media data to extract sentiment analysis for a marketing campaign. The data is highly unstructured and requires real-time processing. Describe how you would set up a data processing pipeline to handle this task, including the technologies you would use and how you would ensure the pipeline is scalable and efficient.
A
Use a batch processing approach with SQL databases and ignore real-time requirements.
B
Leverage Apache Kafka for real-time data ingestion, use AWS Lambda for real-time processing, and Amazon Elasticsearch Service for sentiment analysis.
C
Store all data in a single database and process it using scheduled batch jobs.
D
Manually process each data source separately without integrating them.