Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
After migrating an ETL job to BigQuery, how can you compare its output with the original job's output when the tables lack a primary key for joining, but you have the original job's output table?
A
Use a Dataproc cluster and the BigQuery Hadoop connector to read the data from each table and calculate a hash from non-timestamp columns of the table after sorting. Compare the hashes of each table.
B
Select random samples from the tables using the RAND() function and compare the samples.
C
Select random samples from the tables using the HASH() function and compare the samples.
D
Create stratified random samples using the OVER() function and compare equivalent samples from each table.