Google Professional Data Engineer

Get started today

Ultimate access to all questions.

After migrating an ETL job to BigQuery, how can you compare its output with the original job's output when the tables lack a primary key for joining, but you have the original job's output table?

Real Exam

Use the RAND() function to select random samples from each table and compare these samples.

0.0%

Employ the HASH() function to select random samples from the tables and compare these samples.

16.7%

Comments

Loading comments...

Utilize a Dataproc cluster with the BigQuery Hadoop connector to read data from each table, sort the data, and calculate a hash from non-timestamp columns for comparison.

Create stratified random samples using the OVER() function and compare equivalent samples from each table.