
Ultimate access to all questions.
In the context of training a machine learning model with data stored in BigQuery that includes Personally Identifiable Information (PII), and considering all columns are crucial for the model's performance, what are the best practices to ensure privacy without diminishing the data's utility? Choose two correct options.
A
Utilize the Cloud Data Loss Prevention (DLP) API to identify sensitive data, then apply Dataflow with the DLP API to encrypt sensitive values using Format Preserving Encryption.
B
Before training, filter out columns containing sensitive data in BigQuery and establish an authorized view to restrict access to sensitive information.
C
Employ Dataflow to extract columns with sensitive data from BigQuery and then randomly shuffle the values within each sensitive column.
D
Scan for sensitive data using the Cloud Data Loss Prevention (DLP) API, then use Dataflow to replace all sensitive data with AES-256 encryption, including a salt.
E
Implement both Format Preserving Encryption for columns used in model training and AES-256 encryption for columns not directly used in training but required for compliance.