
Ultimate access to all questions.
A data engineer wants to schedule their Databricks SQL dashboard to refresh every hour, but they only want the associated SQL endpoint to be running when it is necessary. The dashboard has multiple queries on multiple datasets associated with it. The data that feeds the dashboard is automatically processed using a Databricks Job. Which of the following approaches can the data engineer use to minimize the total running time of the SQL endpoint used in the refresh schedule of their dashboard?
A
They can turn on the Auto Stop feature for the SQL endpoint.
B
They can ensure the dashboard's SQL endpoint is not one of the included query's SQL endpoint.
C
They can reduce the cluster size of the SQL endpoint.
D
They can ensure the dashboard's SQL endpoint matches each of the queries' SQL endpoints.
E
They can set up the dashboard's SQL endpoint to be serverless.
Explanation:
The correct answer is A because:
Auto Stop Feature: This feature automatically stops the SQL endpoint after a period of inactivity, which directly addresses the requirement to "only want the associated SQL endpoint to be running when it is necessary."
Why other options are incorrect:
Key Insight: The Auto Stop feature is specifically designed to stop SQL endpoints when they're not actively processing queries, which aligns perfectly with the requirement to minimize total running time while maintaining hourly refresh capability.