
Ultimate access to all questions.
A data engineer wants to create a relational object by pulling data from two tables. The relational object does not need to be used by other data engineers in other sessions. In order to save on storage costs, the data engineer wants to avoid copying and storing physical data.
Which of the following relational objects should the data engineer create?
A
Spark SQL Table
B
View
C
Database
D
Temporary view
E
Delta Table
Explanation:
The correct answer is D. Temporary view.
Here's why:
Avoids copying and storing physical data: Temporary views are logical constructs that don't store physical data. They are defined by a query that references existing tables, so no additional storage is required.
Not needed by other data engineers in other sessions: Temporary views are session-scoped, meaning they only exist for the duration of the current Spark session and are not visible to other users or sessions.
Comparison with other options:
Key points:
CREATE TEMPORARY VIEW view_name AS SELECT ... FROM table1 JOIN table2 ...