
Ultimate access to all questions.
A data engineering team has noticed that their Databricks SQL queries are running too slowly when they are submitted to a non-running SQL endpoint. The data engineering team wants this issue to be resolved. Which of the following approaches can the team use to reduce the time it takes to return results in this scenario?
A
They can turn on the Serverless feature for the SQL endpoint and change the Spot Instance Policy to "Reliability Optimized."
B
They can turn on the Auto Stop feature for the SQL endpoint.
C
They can increase the cluster size of the SQL endpoint.
D
They can turn on the Serverless feature for the SQL endpoint.
E
They can increase the maximum bound of the SQL endpoint's scaling range
Explanation:
When Databricks SQL queries are running too slowly when submitted to a non-running SQL endpoint, the main issue is that the endpoint needs to start up (cold start) before processing queries. This startup time contributes significantly to the overall query execution time.
Let's analyze each option:
A. Turn on Serverless feature and change Spot Instance Policy to "Reliability Optimized"
B. Turn on the Auto Stop feature for the SQL endpoint
C. Increase the cluster size of the SQL endpoint (CORRECT)
D. Turn on the Serverless feature for the SQL endpoint
E. Increase the maximum bound of the SQL endpoint's scaling range
Key Insight: The question specifically mentions queries are slow when submitted to a non-running SQL endpoint. While cold start is part of the problem, increasing cluster size provides more compute resources that can process queries faster once the endpoint is running, which is the most direct solution to improve query performance.