
Answer-first summary for fast verification
Answer: The identified records will be removed from the customers table, but they will remain accessible in the table history until a VACUUM command is executed.
Correct Delete requests, or requests to be forgotten, involve the deletion of user data that constitutes Personally Identifiable Information (PII), such as names and email addresses. Due to the implementation of Delta Lake tables' time travel feature, deleted data remains in older versions of the dataset. It's important to note that deleting data does not remove the data files from the table directory. Instead, it generates a copy of the affected files excluding the deleted records. To finalize these deletions, executing VACUUM commands on the customers table is necessary. Reference: [Implementing the GDPR Right to Be Forgotten in Delta Lake](https://www.databricks.com/blog/2022/03/23/implementing-the-gdpr-right-to-be-forgotten-in-delta-lake.html)
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
What happens when the following query is executed to process customers' requests to be forgotten?
DELETE FROM customers
WHERE customer_id IN
(SELECT customer_id FROM delete_requests)
DELETE FROM customers
WHERE customer_id IN
(SELECT customer_id FROM delete_requests)
A
The identified records will be removed from the customers table, and their associated data files will be permanently deleted from the table directory.
B
The identified records will be removed from both the customers and delete_requests tables, and their associated data files will be permanently deleted from the tables' directories.
C
The identified records will be removed from the customers table, but they will remain accessible in the table history until a VACUUM command is executed.
D
The identified records will be removed from both the customers and delete_requests tables, but they will remain accessible in the table history until VACUUM commands are executed.
E
The identified records will be removed from the customers table, but they will remain accessible in the table history until the status of the requests in the delete_requests table is updated.