
Answer-first summary for fast verification
Answer: Implement a data transformation layer that can handle different data formats and convert them to a common format.
Option B is the correct approach for handling data from multiple sources with varying formats. Implementing a data transformation layer allows the pipeline to process and convert different data formats to a common format, ensuring consistency and accuracy. Option A may not be feasible if the sources cannot or will not conform to a unified data model. Option C, while useful for reducing data duplication, may not address the need for data transformation. Option D limits the pipeline's flexibility and ability to process diverse data sources.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are working on a data pipeline that processes data from multiple sources with varying data formats. How would you approach the problem of data integration and transformation to ensure consistency and accuracy?
A
Create a unified data model that all sources must conform to before ingestion.
B
Implement a data transformation layer that can handle different data formats and convert them to a common format.
C
Use a data virtualization approach to query data from sources directly without integration.
D
Restrict the pipeline to only process data from sources with compatible formats.
No comments yet.