
Ultimate access to all questions.
A data warehouse team is worried about the potential for poor quality controls in some data sources and wishes to avoid importing incorrect or invalid data into the data warehouse. What initial step could they take to assess the extent of the issue before developing ETL code?
A
Load all source data into a data lake and then proceed to load it into the data warehouse.
B
Request that administrators of the source systems generate a data quality verification prior to exporting the data.
C
Conduct a data quality assessment on the source data post-extraction. This assessment should include checks for value ranges, value distributions, counts of invalid and missing values, among other source data checks.
D
Import the data directly into the data warehouse and record any records that do not pass integrity or consistency checks.