Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
Given a PySpark DataFrame named 'spark_df', write a code snippet that demonstrates how to perform a distributed groupby operation on a column named 'category' using Pandas API on Spark.
A
grouped = spark_df.groupby('category').collect()
B
grouped = spark_df.toPandasAPI().groupby('category').collect()
C
grouped = spark_df.groupby('category').toPandasAPI().collect()
D
grouped = spark_df.toPandasAPI().groupby('category').toSpark().collect()