
Ultimate access to all questions.
In the context of developing a Spark ML Pipeline, explain the process of identifying key gotchas and how to handle them. Provide a code snippet demonstrating the development of a Spark ML Pipeline and explain how to address the potential issues that may arise.
A
Use the Pipeline class from the pyspark.ml module to define the stages of the pipeline, and ensure that each stage is a Spark ML estimator or transformer.
B
Use the PipelineModel class from the pyspark.ml module to define the stages of the pipeline, and ensure that each stage is a Spark ML estimator or transformer.
C
Use the Pipeline class from the pyspark.ml module to define the stages of the pipeline, but be aware of the potential for data leakage between stages.
D
Use the PipelineModel class from the pyspark.ml module to define the stages of the pipeline, but be aware of the potential for data leakage between stages.