
Ultimate access to all questions.
When loading CSV files from Cloud Storage to BigQuery, you encounter data quality issues such as mixed data types in the same column and inconsistent formatting of values like phone numbers or addresses. What is the best approach to ensure data quality, perform necessary cleansing, and transformation in your data pipeline?
A
Convert the CSV files to a self-describing data format, such as AVRO, before loading the data to BigQuery using Data Fusion.
B
Load the CSV files into a staging table with the desired schema, perform the transformations with SQL, and then write the results to the final destination table.
C
Create a table with the desired schema, load the CSV files into the table, and perform the transformations in place using SQL.
D
Use Data Fusion to transform the data before loading it into BigQuery.