Google Professional Data Engineer

Get started today

Ultimate access to all questions.

After migrating an ETL job to BigQuery, how can you compare its output with the original job's output when the tables lack a primary key for joining, but you have the original job's output table?

Real Exam

Use a Dataproc cluster and the BigQuery Hadoop connector to read the data from each table and calculate a hash from non-timestamp columns of the table after sorting. Compare the hashes of each table.

76.9%

Select random samples from the tables using the RAND() function and compare the samples.

Comments

Loading comments...

Create stratified random samples using the OVER() function and compare equivalent samples from each table.

7.7%