
Ultimate access to all questions.
Which statement regarding the relationship between Silver tables and Bronze tables is always true?
A
Silver tables contain a less refined, less clean view of data than Bronze data.
B
Silver tables contain aggregates while Bronze data is unaggregated.
C
Silver tables contain more data than Bronze tables.
D
Silver tables contain less data than Bronze tables.
Explanation:
In the Databricks Lakehouse architecture, data flows through different quality layers:
Bronze Layer: Raw, unprocessed data ingested from source systems. This contains all data including duplicates, errors, and inconsistencies.
Silver Layer: Cleaned, validated, and enriched data. This layer applies data quality checks, deduplication, and basic transformations.
Gold Layer: Aggregated, business-ready data for reporting and analytics.
Why option D is correct: Silver tables typically contain less data than Bronze tables because:
Why other options are incorrect:
The Silver layer represents the 'single source of truth' with clean, validated data, which is always a subset of the raw Bronze data after quality processing.