
Ultimate access to all questions.
In the context of Azure Databricks Lakehouse architecture, Change Data Feed (CDF) plays a pivotal role in managing data changes efficiently. Considering a scenario where a multinational corporation requires real-time analytics on customer transactions that are frequently updated and deleted across different regions, which of the following best describes how CDF addresses the challenges of propagating updates and deletes, and why it would be the optimal solution in this scenario? (Choose one correct answer)
A
CDF does not support real-time data changes, making it unsuitable for scenarios requiring immediate analytics on updated or deleted data.
B
CDF captures and propagates only the metadata of changes without affecting the actual data, thus ensuring data remains static for analytics.
C
CDF creates a full snapshot of the data for every change, significantly increasing storage costs and complexity without providing real-time change propagation.
D
CDF efficiently captures and propagates row-level changes to the data, enabling real-time analytics and reporting without the need for full data scans or complex ETL processes.