
Ultimate access to all questions.
Which of the following data workloads will utilize a Bronze table as its source?
A
A job that aggregates cleaned data to create standard summary statistics
B
A job that queries aggregated data to publish key insights into a dashboard
C
A job that ingests raw data from a streaming source into the Lakehouse
D
A job that develops a feature set for a machine learning application
E
A job that enriches data by parsing its timestamps into a human-readable format
Explanation:
In the Databricks Lakehouse architecture, the medallion architecture consists of three layers:
Looking at the options:
Option E is correct because parsing timestamps into human-readable format is a data cleaning/enrichment task that takes raw data from Bronze tables and transforms it into a more usable format, which is characteristic of Silver layer processing.
Bronze tables serve as the source for data cleaning, validation, and enrichment tasks that produce Silver tables. The timestamp parsing operation is exactly the type of data quality improvement that happens when moving from Bronze to Silver layer.