Databricks Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

Deep dive into the quiz with AI chat providers.

We prepare a focused prompt with your quiz and certificate details so each AI can offer a more tailored, in-depth explanation.

A data engineer has a Python variable table_name that they would like to use in a SQL query. They want to construct a Python code block that will run the query using table_name. They have the following incomplete code block:

(f"SELECT customer_id, spend FROM {table_name}")

(f"SELECT customer_id, spend FROM {table_name}")

Which of the following can be used to fill in the blank to successfully complete the task?

Real Exam

Community

KKeng

Last updated: January 13, 2026 at 09:01

spark.delta.sql

spark.delta.table

spark.table

dbutils.sql

spark.sql

Explanation:

The correct answer is spark.sql because:

spark.sql() is the primary method in PySpark for executing SQL queries in Databricks.
The code block shows an f-string that constructs a SQL query with the table_name variable interpolated.
The syntax should be: spark.sql(f"SELECT customer_id, spend FROM {table_name}")

Why other options are incorrect:

A. spark.delta.sql: This method doesn't exist in the Databricks/Spark API.
B. spark.delta.table: This is used for Delta table operations, not for executing SQL queries.
C. spark.table: This method is used to create a DataFrame from a table, not to execute SQL queries.
D. dbutils.sql: dbutils doesn't have a .sql method; SQL execution is done through spark.sql().

Key Concept: In Databricks, spark.sql() is the standard way to execute SQL queries from Python code, and it supports string interpolation using f-strings or other string formatting methods.

Powered ByGPT-5.2

Comments

Loading comments...