
Ultimate access to all questions.
A data engineering team has noticed that their Databricks SQL queries are running too slowly when they are submitted to a non-running SQL endpoint. The data engineering team wants this issue to be resolved.
Which of the following approaches can the team use to reduce the time it takes to return results in this scenario?
A
They can turn on the Serverless feature for the SQL endpoint and change the Spot Instance Policy to "Reliability Optimized."
B
They can turn on the Auto Stop feature for the SQL endpoint.
C
They can increase the cluster size of the SQL endpoint.
D
They can turn on the Serverless feature for the SQL endpoint.
E
They can increase the maximum bound of the SQL endpoint's scaling range.
Explanation:
When SQL queries are submitted to a non-running SQL endpoint, there is a cold start time required to provision and start the compute resources. The key issue described is that queries run too slowly specifically when submitted to a non-running endpoint.
Let's analyze each option:
A. Turn on Serverless feature and change Spot Instance Policy to "Reliability Optimized" - This is incorrect because:
B. Turn on the Auto Stop feature - This is incorrect because:
C. Increase the cluster size of the SQL endpoint - This is incorrect because:
D. Turn on the Serverless feature for the SQL endpoint - CORRECT because:
E. Increase the maximum bound of the SQL endpoint's scaling range - This is incorrect because:
Key Concept: Serverless SQL endpoints in Databricks provide instant-on capabilities by keeping compute resources ready in a warm pool, eliminating the need to wait for cluster startup when submitting queries to a non-running endpoint.