
Ultimate access to all questions.
In which of the following file formats is data from Delta Lake tables primarily stored?
A
Delta
B
CSV
C
Parquet
D
JSON
E
A proprietary, optimized format specific to Databricks
Explanation:
Delta Lake tables primarily store data in Parquet format. Here's why:
Parquet is the underlying storage format: Delta Lake uses Parquet files as the base storage format for data. The "Delta" aspect refers to the transaction log and metadata layer that sits on top of Parquet files, not the underlying data storage format.
Delta Lake architecture: Delta Lake consists of:
Benefits of Parquet:
Why not the other options:
Key takeaway: Delta Lake enhances Parquet files with transaction capabilities and metadata management, but the actual data is stored in Parquet format.