Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.

Explanation:

The code snippet provided demonstrates a batch processing operation where data from three tables ('accounts', 'orders', 'items') is joined and then written to a new table 'enriched_itemized_orders_by_account' in overwrite mode. This means that every time the code is executed, the entire content of the 'enriched_itemized_orders_by_account' table is replaced with the current results of the join operations, using the latest data from the source tables. The correct option is B because it accurately describes this behavior. Options A, C, D, and E describe scenarios involving incremental updates, primary key-based updates, or lazy evaluation, none of which are applicable given the use of overwrite mode and the absence of any incremental processing logic in the code.

Explanation:

Comments (0)

No comments yet.

accountDF = spark.table("accounts")
orderDF = spark.table("orders")
itemDF = spark.table("items")
orderWithItemDF = (orderDF.join(
    itemDF,
    orderDF.itemID == itemDF.itemID)
    .select(
        orderDF.accountID, 
        orderDF.itemID,
        itemDF.itemName)
)
finalDF = (accountDF.join(
    orderWithItemDF,
    accountDF.accountID == orderWithItemDF.accountID)
    .select(
        orderWithItemDF["*"],
        accountDF.city)
)
(finalDF.write
    .mode("overwrite")
    .table("enriched_itemized_orders_by_account"))

accountDF = spark.table("accounts")
orderDF = spark.table("orders")
itemDF = spark.table("items")
orderWithItemDF = (orderDF.join(
    itemDF,
    orderDF.itemID == itemDF.itemID)
    .select(
        orderDF.accountID, 
        orderDF.itemID,
        itemDF.itemName)
)
finalDF = (accountDF.join(
    orderWithItemDF,
    accountDF.accountID == orderWithItemDF.accountID)
    .select(
        orderWithItemDF["*"],
        accountDF.city)
)
(finalDF.write
    .mode("overwrite")
    .table("enriched_itemized_orders_by_account"))

Question
Assuming this code produces logically correct results and the source tables have been deduplicated and validated, what will happen when this code is executed?

Exam-Like

Last updated: March 31, 2026 at 14:05

A batch job will update the enriched_itemized_orders_by_account table, replacing only those rows that have different values than the current version of the table, using accountID as the primary key.

11.5%

The enriched_itemized_orders_by_account table will be overwritten using the current valid version of data in each of the three tables referenced in the join logic.

67.9%