You are given a Spark DataFrame 'df' with a numerical column 'age'. Write a code snippet that computes the mean, median, and standard deviation of the 'age' column using dbutils data summaries, and explain the steps involved.

Simulated

from pyspark.sql.functions import mean, median, stddev result = df.select(mean('age'), median('age'), stddev('age')) print(A)

20.6%

result = dbutils.data.Summary(df, 'age') print(result['mean'], result['median'], result['stddev'])

58.8%

result = df.describe() print(result['age']['mean'], result['age']['50%'], result['age']['stddev'])

8.8%

result = df.agg(mean('age'), median('age'), stddev('age')) print(D)

11.8%

Databricks Certified Machine Learning - Associate