Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
In a Databricks notebook, a data engineer is tasked with normalizing text data by converting all strings to lowercase and removing spaces. What is the most efficient method to achieve this within Databricks?
A
Manually edit each string in the dataset before loading it into Databricks.
B
Export the dataset, process it with a script outside of Databricks, and then re-import the cleaned data.
C
Utilize a Databricks built-in function for string normalization in a Spark SQL query.
D
Implement a UDF in Databricks that lowercases and trims spaces from strings, applying it via a Spark SQL query.