Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
Consider a scenario where you have a large dataset that needs to be processed and analyzed using Pandas-like operations. You are given the option to use either native Spark or Pandas API on Spark. Which option would you choose and why?
A
Use native Spark because it is faster and more efficient for large datasets.
B
Use Pandas API on Spark because it provides a familiar Pandas-like API and requires less refactoring of existing code.
C
Use both native Spark and Pandas API on Spark simultaneously to take advantage of their respective strengths.
D
Use neither native Spark nor Pandas API on Spark, as they are not suitable for large datasets.