Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
For a data lakehouse designed to maintain immutable historical records of rapidly changing data, which strategy ensures efficient query performance and data integrity across versioned records?
A
Store each version as a separate file in a partitioned directory structure, leveraging metastore version annotations for query access.
B
Implement a branching data model within the lakehouse, creating separate branches for each version and merging them for analysis.
C
Utilize Delta Lake’s MERGE feature to upsert changes into a single versioned table, partitioned by the record timestamp.
D
Use Delta Lake time travel capabilities to access historical versions, optimizing current version access via partitioning and z-ordering.