Databricks Certified Machine Learning - Associate

Ultimate access to all questions.

Comments

Loading comments...

Simulated

from pyspark.sql.functions import cov result = df.select(cov('price', 'quantity')) print(A)

56.4%

result = df.stat.corr('price', 'quantity') print(B)

10.3%

result = df.withColumn('price_quantity', df.price * df.quantity) result = result.select(cov('price_quantity', 'price')) print(C)*

23.1%

result = df.groupBy().agg(cov('price', 'quantity')) print(D)

10.3%