Databricks Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

In the context of incremental data processing within Azure Databricks, a data engineer is tasked with ensuring that data from a source table is efficiently combined with a target table without duplicating records. The solution must also maintain the existing schema of the target table and optimize for performance. Considering these requirements, which of the following statements best describes the primary benefit of using the MERGE command in this scenario? Choose the single best option.

Simulated

The MERGE command allows for the creation of a new table or the complete replacement of an existing table with a new schema, which is not required in this scenario.

2.5%

The MERGE command ensures that there is no duplication of data in the target table by only inserting new records from the source table.

Comments

Loading comments...

The MERGE command combines data from the source and target tables, writes the result to the target table while removing duplicates, and maintains the existing schema, making it the most suitable option for incremental data processing under the given constraints.

62.3%

The MERGE command overwrites all existing data in the target table with new data from the source table without altering the table schema, which does not meet the requirement of combining data without duplication.

9.9%