Microsoft Azure Data Engineer Associate - DP-203

Ultimate access to all questions.

In a data processing pipeline, you have identified a skew in the data distribution across different partitions. How would you handle this skew to ensure balanced processing and avoid hotspots in your distributed system?

Simulated

Increase the number of partitions to distribute the data more evenly.

0.0%

Implement a custom partitioning logic that redistributes the skewed data across existing partitions.

75.0%

Loading comments...

Use a salting technique to add a random element to the data keys to distribute the load more evenly.

25.0%