
Ultimate access to all questions.
In the context of managing and querying time-series data within a Delta Lake environment, consider the following scenario: Your organization is building a data pipeline that processes high-volume time-series data. The pipeline must support frequent updates and queries, while ensuring data consistency and minimizing operational overhead. Given these requirements, which Delta Lake feature should you leverage to most effectively manage and query this time-series data? Choose the best option from the following:
A
Partition the Delta table by the timestamp column to enhance query performance, without considering the need for data updates.
B
Utilize Delta Lake's time travel feature to access historical data snapshots, focusing solely on data retrieval without addressing data modification needs.
C
Disable the transaction log to reduce logging overhead, sacrificing data consistency and atomicity for operational simplicity.
D
Implement Delta Lake's upsert feature to efficiently update or insert time-series data based on a unique timestamp, ensuring data consistency and minimizing the need for separate transactions.