
Ultimate access to all questions.
A data analysis team has noticed that their Databricks SQL queries are running too slowly when connected to their always-on SQL endpoint. They claim that this issue is present when many members of the team are running small queries simultaneously. They ask the data engineering team for help. The data engineering team notices that each of the team's queries uses the same SQL endpoint.
Which approach can the data engineering team use to improve the latency of the team's queries?
A
They can increase the cluster size of the SQL endpoint.
B
They can increase the maximum bound of the SQL endpoint's scaling range.
C
They can turn on the Auto Stop feature for the SQL endpoint.
D
They can turn on the Serverless feature for the SQL endpoint.
Explanation:
When many team members are running small queries simultaneously on the same SQL endpoint, the issue is likely related to concurrency limitations. SQL endpoints in Databricks have scaling ranges that determine how many clusters can be spawned to handle concurrent queries.
Let's analyze each option:
A. They can increase the cluster size of the SQL endpoint.
B. They can increase the maximum bound of the SQL endpoint's scaling range. ✅ CORRECT
C. They can turn on the Auto Stop feature for the SQL endpoint.
D. They can turn on the Serverless feature for the SQL endpoint.
Key Concept: SQL endpoint scaling ranges control how many clusters can be created to handle concurrent workloads. When many users run queries simultaneously, increasing the maximum bound allows more clusters to be created, reducing contention and improving query latency.