Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
In a stream processing solution, you need to process data from multiple sources with varying data quality. How would you approach this task to ensure efficient and accurate processing?
A
Ignore the data quality issues and process the data as is.
B
Process each data source independently using a specific processing approach for each data quality issue.
C
Use a distributed processing framework like Spark to process the data from multiple sources with varying data quality.
D
Use a distributed processing framework like Spark to process the data from multiple sources with varying data quality and implement data cleaning and preprocessing techniques.