Databricks Certified Data Engineer - Professional

Ultimate access to all questions.

The data engineering team maintains a Silver table named 'sales_cleaned' with sales data appended in near real-time. They aim to create a Gold-layer entity from 'sales_cleaned' to compute the year-to-date (YTD) sales amount, featuring the schema: country_code STRING, category STRING, ytd_total_sales FLOAT, updated TIMESTAMP. The metrics need daily recalculation but are queried frequently by various business teams, prompting a need to minimize costs and latency. Which solution best fits these requirements?

Real Exam

Define the new entity as a global temporary view for shared computing resources among notebooks or jobs.

14.9%

Set up a nightly batch job to recalculate metrics and overwrite them in a table with each update.

28.5%

Loading comments...