In a Databricks environment, you are working with a Delta table named customers that contains a column called customer_data. This column is of type STRING and contains JSON text with customer information. Inside each JSON object, there is an address field, which is itself a nested JSON object containing city and zipcode fields.

You need to write a Spark SQL query to extract the city and zipcode values from the address object and return them as separate columns. The solution must be efficient, scalable, and use correct Spark SQL syntax for working with JSON strings.

Which of the following queries correctly achieves this?_

Simulated

A

SELECT customer_data.address.city, customer_data.address.zipcode FROM customers

21.7%

B

SELECT city, zipcode FROM customers CROSS JOIN JSON_TABLE(customer_data, '$.address')

7.7%

C

SELECT customer_data['address']['city'], customer_data['address']['zipcode'] FROM customers

11.7%

D

SELECT get_json_object(customer_data, ' $.address.city') AS city, get_json_object(customer_data, '$ .address.zipcode') AS zipcode FROM customers

58.9%

Powered ByGPT-5.2

Databricks Certified Data Engineer - Associate