Google Professional Data Engineer

Get started today

Ultimate access to all questions.

Explanation:

Format-preserving encryption (FPE) with FFX in Cloud DLP is a strong choice for de-identifying PII like email addresses. FPE maintains the format of the data and ensures that the same input results in the same encrypted output consistently. This means the email fields in both datasets can be encrypted to the same value, allowing for accurate joins in BigQuery while keeping the actual email addresses hidden. Masking (Option A) would not preserve the uniqueness required for joins, and dynamic data masking (Options C and D) occurs within BigQuery, which does not satisfy the requirement of de-identifying data before loading into BigQuery.

Explanation:

Comments (0)

No comments yet.

Your company operates a data platform that continually ingests CSV file dumps containing booking and user profile data from upstream sources into Google Cloud Storage. The analyst team needs to perform a join operation on these datasets using the common email field for their analysis. However, it is crucial to ensure that personally identifiable information (PII) is not exposed to the analysts during this process. To achieve this, you must de-identify the email field in both datasets prior to loading them into BigQuery. What approach should you take?

Exam-Like

Last updated: May 6, 2026 at 14:02

Create a pipeline to de-identify the email field by using recordTransformations in Cloud Data Loss Prevention (Cloud DLP) with masking as the de-identification transformations type.
Load the booking and user profile data into a BigQuery table.

19.6%

Create a pipeline to de-identify the email field by using recordTransformations in Cloud DLP with format-preserving encryption with FFX as the de-identification transformation type.
Load the booking and user profile data into a BigQuery table.

45.1%

Load the CSV files from Cloud Storage into a BigQuery table, and enable dynamic data masking.
Create a policy tag with the email mask as the data masking rule.
Assign the policy to the email field in both tables.
Assign the Identity and Access Management bigquerydatapolicy.maskedReader role for the BigQuery tables to the analysts.

17.6%

Load the CSV files from Cloud Storage into a BigQuery table, and enable dynamic data masking.
Create a policy tag with the default masking value as the data masking rule.
Assign the policy to the email field in both tables.
Assign the Identity and Access Management bigquerydatapolicy.maskedReader role for the BigQuery tables to the analysts.

17.6%