Databricks Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

Deep dive into the quiz with AI chat providers.

We prepare a focused prompt with your quiz and certificate details so each AI can offer a more tailored, in-depth explanation.

Which of the following queries is performing a streaming hop from raw data to a Bronze table?

Real Exam

Community

KKeng

Last updated: January 13, 2026 at 09:15

(spark.table("sales")
  .groupBy("store")
  .agg(sum("sales"))
  .writeStream
  .option("checkpointLocation", checkpointPath)
  .outputMode("complete")
  .table("newSales")
)

(spark.table("sales")
  .groupBy("store")
  .agg(sum("sales"))
  .writeStream
  .option("checkpointLocation", checkpointPath)
  .outputMode("complete")
  .table("newSales")
)

(spark.table("sales")
  .filter(col("units") > 0)
  .writeStream
  .option("checkpointLocation", checkpointPath)
  .outputMode("append")
  .table("newSales")
)

(spark.table("sales")
  .filter(col("units") > 0)
  .writeStream
  .option("checkpointLocation", checkpointPath)
  .outputMode("append")
  .table("newSales")
)

(spark.table("sales")
  .withColumn("avgPrice", col("sales") / col("units"))
  .writeStream
  .option("checkpointLocation", checkpointPath)
  .outputMode("append")
  .table("newSales")
)

(spark.table("sales")
  .withColumn("avgPrice", col("sales") / col("units"))
  .writeStream
  .option("checkpointLocation", checkpointPath)
  .outputMode("append")
  .table("newSales")
)

(spark.read.load(rawSalesLocation)
  .write
  .mode("append")
  .table("newSales")
)

(spark.read.load(rawSalesLocation)
  .write
  .mode("append")
  .table("newSales")
)

(spark.readStream.load(rawSalesLocation)
  .writeStream
  .option("checkpointLocation", checkpointPath)
  .outputMode("append")
  .table("newSales")
)

(spark.readStream.load(rawSalesLocation)
  .writeStream
  .option("checkpointLocation", checkpointPath)
  .outputMode("append")
  .table("newSales")
)

Explanation:

Explanation

Option E is the correct answer because it demonstrates a streaming hop from raw data to a Bronze table. Here's why:

Key Characteristics of a Streaming Hop:

Reads from raw data source - Uses spark.readStream.load(rawSalesLocation) which reads streaming data from raw files
Writes to a table - Uses .table("newSales") to write to a Delta table (Bronze layer)
Streaming processing - Uses readStream and writeStream for continuous data ingestion
Checkpointing - Includes .option("checkpointLocation", checkpointPath) for fault tolerance
Append mode - Uses .outputMode("append") appropriate for streaming ingestion

Why Other Options Are Incorrect:

Option A, B, C: These read from an existing table (spark.table("sales")) rather than raw data, so they're not performing the initial ingestion from raw to Bronze.
Option D: This uses batch processing (spark.read.load) instead of streaming, so it's not a streaming hop.

Bronze Table Context:

In Databricks Medallion Architecture:

Bronze tables store raw, unprocessed data as-is from source systems
Streaming hops are used for continuous ingestion from raw data sources
The Bronze layer serves as the landing zone for all incoming data before any transformations

Option E correctly implements this pattern by streaming data from raw files directly into a Bronze table.

Powered ByGPT-5.2

Comments

Loading comments...