Databricks Certified Machine Learning - Associate

Ultimate access to all questions.

In a big data environment, you are tasked with implementing a UDF that processes a very large dataset. Which type of Pandas UDF would you prefer and why?

Simulated

Scalar UDF because they are simpler to implement.

12.5%

Iterator UDF because they can handle large datasets more efficiently by processing data in chunks.

70.8%

Loading comments...