Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
Consider a scenario where you need to upsert batch data into a Delta Lake. The data includes customer profiles that are updated frequently. How would you structure your upsert operation to ensure data consistency and minimize performance impact?
A
Use a MERGE statement with a unique identifier in the Delta Lake to perform the upsert operation.
B
Delete the existing data and insert the new data each time to ensure consistency.
C
Perform the upsert operation using a Python script that checks for duplicates before inserting.
D
Manually compare the new data with the existing data in the Delta Lake and update records one by one.