Databricks Certified Associate Developer for Apache Spark

Get started today

Ultimate access to all questions.

Explanation:

The question asks for a 15% sample without replacement. In PySpark, DataFrame.sample() defaults to withReplacement=False if not specified. Option B correctly sets fraction=0.15 and omits withReplacement, thus using the default. Option A uses True (with replacement), which is incorrect. Option C uses sampleBy, which requires a col and a fractions dict. Option D's fraction is 0.10 (10%). Option E is missing the fraction parameter, causing an error. Therefore, only B is correct.

Explanation:

Comments (0)

No comments yet.

Which of the following code blocks returns a 15% sample of rows from the DataFrame `storesDF` without replacement?

Exam-Like

Last updated: April 28, 2026 at 14:02

storesDF.sample(True, fraction = 0.15)

21.9%

storesDF.sample(fraction = 0.15)

67.6%

storesDF.sampleBy(fraction = 0.15)

7.6%

storesDF.sample(fraction = 0.10)

1.9%

storesDF.sample()

1.0%

Databricks Certified Associate Developer for Apache Spark

Get started today

Comments (0)

Get started today

Comments (0)

Which of the following code blocks returns a 15% sample of rows from the DataFrame storesDF without replacement?

Which of the following code blocks returns a 15% sample of rows from the DataFrame `storesDF` without replacement?