
Answer-first summary for fast verification
Answer: Silver tables contain less data than Bronze tables.
## Explanation In the Databricks Lakehouse architecture, data flows through different quality layers: 1. **Bronze Layer**: Raw, unprocessed data ingested from source systems. This contains all data including duplicates, errors, and inconsistencies. 2. **Silver Layer**: Cleaned, validated, and enriched data. This layer applies data quality checks, deduplication, and basic transformations. 3. **Gold Layer**: Aggregated, business-ready data for reporting and analytics. **Why option D is correct**: Silver tables typically contain **less data** than Bronze tables because: - Duplicate records are removed - Invalid or erroneous records are filtered out - Data is deduplicated and cleaned - Only relevant data is retained after quality checks **Why other options are incorrect**: - **A**: False - Silver tables contain MORE refined, cleaner data than Bronze - **B**: False - Aggregation typically happens in the Gold layer, not Silver - **C**: False - Silver tables contain LESS data, not more, due to cleaning and deduplication The Silver layer represents the 'single source of truth' with clean, validated data, which is always a subset of the raw Bronze data after quality processing.
Author: Keng Suppaseth
Ultimate access to all questions.
Which statement regarding the relationship between Silver tables and Bronze tables is always true?
A
Silver tables contain a less refined, less clean view of data than Bronze data.
B
Silver tables contain aggregates while Bronze data is unaggregated.
C
Silver tables contain more data than Bronze tables.
D
Silver tables contain less data than Bronze tables.
No comments yet.