Reddit

A data engineer has set up a Structured Streaming job to read from a table, aggregate the data, and then perform a streaming write into a new table. The code block used is as follows:

spark.table("sales")
.groupBy("store")
.agg(sum("sales").alias("sum_sales"))
.writeStream
.option("checkpointLocation", checkpointPath)
.outputMode("complete")
.______
.table("aggregatedSales")

spark.table("sales")
.groupBy("store")
.agg(sum("sales").alias("sum_sales"))
.writeStream
.option("checkpointLocation", checkpointPath)
.outputMode("complete")
.______
.table("aggregatedSales")

If the goal is to execute only a single micro-batch to process all available data, which line of code should fill in the blank?___

Real Exam

trigger(continuous="once")

10.2%

processingTime("once")

6.5%

trigger(processingTime="once")

27.4%

trigger(once=True)

52.1%

processingTime(1)

3.7%

Databricks Certified Data Engineer - Associate

Comments

Get started today