Explanation
In the Databricks Lakehouse architecture, the Bronze table is the first layer where raw data is ingested and stored in its original format. The Bronze layer typically contains:
- Raw, unprocessed data
- Data in its original format (JSON, CSV, etc.)
- Data that may need cleaning, transformation, or enrichment
Looking at the options:
- Option A: Aggregates cleaned data - This would use Silver or Gold tables as source, not Bronze
- Option B: Queries aggregated data - This would use Gold tables for reporting/dashboards
- Option C: Ingests raw data - This is the process that CREATES Bronze tables, not uses them as source
- Option D: Develops feature sets - This typically uses Silver tables with cleaned, structured data
- Option E: Enriches data by parsing timestamps - This is exactly the type of transformation that would use Bronze tables as source, taking raw data and performing initial data quality/enrichment operations
Therefore, Option E is correct because parsing timestamps from raw data into human-readable format is a typical data enrichment operation that would use Bronze tables as the source.