Databricks Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

Deep dive into the quiz with AI chat providers.

We prepare a focused prompt with your quiz and certificate details so each AI can offer a more tailored, in-depth explanation.

A data engineer has joined an existing project and they see the following query in the project repository:

CREATE STREAMING LIVE TABLE loyal_customers AS
SELECT customer_id -
FROM STREAM(LIVE.customers)
WHERE loyalty_level = 'high';

CREATE STREAMING LIVE TABLE loyal_customers AS
SELECT customer_id -
FROM STREAM(LIVE.customers)
WHERE loyalty_level = 'high';

Which of the following describes why the STREAM function is included in the query?

Real Exam

Community

KKeng

Last updated: January 13, 2026 at 09:15

The STREAM function is not needed and will cause an error.

The table being created is a live table.

The customers table is a streaming live table.

The customers table is a reference to a Structured Streaming query on a PySpark DataFrame.

The data in the customers table has been updated since its last run.

Explanation:

The STREAM function is used to indicate that the source table (LIVE.customers) is a streaming live table. In Databricks SQL, when creating a streaming live table that reads from another streaming source, you need to use the STREAM() function to specify that the source is a streaming table. This allows the new streaming table to incrementally process data from the source streaming table.

Key points:

The query creates a STREAMING LIVE TABLE called loyal_customers
It reads from STREAM(LIVE.customers) - the STREAM() function wraps the source table
This indicates that LIVE.customers is itself a streaming table
Without the STREAM() function, the query would treat LIVE.customers as a static/batch table
This is documented in Databricks documentation for loading data into streaming tables

Powered ByGPT-5.2

Comments

Loading comments...