Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
For a large-scale data lakehouse with petabytes of data, what is the most efficient method to implement data masking for Personally Identifiable Information (PII) across multiple datasets to comply with privacy regulations?
A
Utilize an on-the-fly encryption model where data is only decrypted and masked as needed when accessed by unauthorized users.
B
Apply real-time data masking at the query level, using dynamic data masking capabilities of the lakehouse platform.
C
Implement column-level security, restricting access to sensitive columns and providing masked views to unauthorized users.
D
Perform batch data masking operations during off-peak hours, rewriting datasets with masked versions of sensitive fields.