Describe a scenario where you would use `repartition` over `coalesce` in PySpark. Explain the reasoning behind your choice and the impact on data shuffling and performance.

Powered ByGPT-5

Databricks Certified Data Engineer - Professional