LeetQuiz Logo
Privacy Policy•contact@leetquiz.com
© 2025 LeetQuiz All rights reserved.
Databricks Certified Data Engineer - Professional

Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.


You are designing a PySpark application to process a large dataset for a financial analytics project. The dataset includes transaction records from the past year, and your application needs to perform various transformations to calculate monthly spending trends. Given the project's requirements for timely insights and the dataset's size, you aim to optimize the job's performance by minimizing data shuffling. Which of the following strategies would be the MOST effective to achieve this goal, and why? Choose one option.

Simulated



Powered ByGPT-5