Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.

In a multi-tenant Spark environment where each tenant's data is processed in isolation but resides within the same cluster, what is the best approach to optimize data aggregation queries for fairness and efficiency across tenants?

Real Exam

Use view-based access controls to isolate tenant data and apply CACHE directives to frequently aggregated datasets for each tenant, ensuring efficient reuse of computation results.

20.2%

Implement custom weighted fair scheduler pools for each tenant and assign Spark jobs to these pools based on the tenant's priority.

Comments

Loading comments...

Utilize dynamic resource allocation to adjust resource distribution based on the current load of each tenant's queries.

24.4%