Databricks Certified Data Engineer - Associate

Ultimate access to all questions.

Explanation:

Explanation

The correct answer is E. spark.table("sales").

spark.table("sales") is the standard PySpark method to access tables registered in the Spark catalog.
This method returns a DataFrame that can be used for data processing, transformations, and testing in PySpark.
It works with Delta tables as well as other table types registered in the Spark catalog.

D. spark.delta.table("sales") is a valid alternative specifically for Delta tables.
This method is provided by the Delta Lake library and is optimized for Delta table operations.
However, spark.table("sales") is more commonly used and works for all table types.

A. SELECT * FROM sales: This is SQL syntax, not PySpark code. While you could use spark.sql("SELECT * FROM sales"), the option as written is not valid PySpark code.
B. There is no way to share data between PySpark and SQL: This is incorrect. PySpark and SQL can share data through the Spark catalog where tables are registered.
C. spark.sql("sales"): This is not a valid SQL query. spark.sql() expects a complete SQL statement, not just a table name.

PySpark-SQL Integration: PySpark and SQL share the same Spark catalog, allowing seamless data access between both environments.
Delta Table Access: Delta tables registered in the catalog can be accessed using standard PySpark methods.
Best Practice: spark.table("table_name") is the most common and recommended way to access registered tables in PySpark.

Explanation:

The correct answer is E. spark.table("sales").

spark.table("sales") is the standard PySpark method to access tables registered in the Spark catalog.
This method returns a DataFrame that can be used for data processing, transformations, and testing in PySpark.
It works with Delta tables as well as other table types registered in the Spark catalog.

D. spark.delta.table("sales") is a valid alternative specifically for Delta tables.
This method is provided by the Delta Lake library and is optimized for Delta table operations.
However, spark.table("sales") is more commonly used and works for all table types.

A. SELECT * FROM sales: This is SQL syntax, not PySpark code. While you could use spark.sql("SELECT * FROM sales"), the option as written is not valid PySpark code.
B. There is no way to share data between PySpark and SQL: This is incorrect. PySpark and SQL can share data through the Spark catalog where tables are registered.
C. spark.sql("sales"): This is not a valid SQL query. spark.sql() expects a complete SQL statement, not just a table name.

PySpark-SQL Integration: PySpark and SQL share the same Spark catalog, allowing seamless data access between both environments.
Delta Table Access: Delta tables registered in the catalog can be accessed using standard PySpark methods.
Best Practice: spark.table("table_name") is the most common and recommended way to access registered tables in PySpark.

No comments yet.

Real Exam

Community

KKeng

Last updated: January 13, 2026 at 09:15

SELECT * FROM sales

8.9%

There is no way to share data between PySpark and SQL.

0.0%

spark.sql("sales")

14.9%

spark.delta.table("sales")

31.2%

spark.table("sales")