
Explanation:
The question asks for operations that induce a shuffle and return a new DataFrame with updated partitions.
Thus, C and E are correct as they involve shuffling and produce new partitions.
Ultimate access to all questions.
No comments yet.
Which of the following operations will consistently produce a new DataFrame with repartitioned data from DataFrame storesDF by triggering a shuffle operation?
A
storesDF.coalesce()
B
storesDF.rdd.getNumPartitions()
C
storesDF.repartition()
D
storesDF.union()
E
storesDF.intersect(otherDF)