LeetQuiz Logo
Privacy Policy•contact@leetquiz.com
© 2025 LeetQuiz All rights reserved.
Databricks Certified Data Engineer - Professional

Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.


In a PySpark application, you are tasked with optimizing the performance of a job that involves performing a series of transformations on a large dataset and subsequently joining it with another dataset. The second dataset is significantly smaller in size. Considering the constraints of cost, compliance, and scalability, which of the following strategies would you choose to ensure the most efficient execution of your job? Choose the best option and explain why it is the most suitable.

Simulated



Powered ByGPT-5