
Answer-first summary for fast verification
Answer: Build the environment in Azure Databricks and use Azure Data Factory for orchestration.
The correct answer is B because Azure Databricks provides a unified platform for both data engineering and data science, supporting Python and Scala, workload isolation, interactive workloads, and cluster scaling. Azure Data Factory is the optimal orchestration tool as it specializes in composing data pipelines across various services, unlike Azure Container Instances (ACI), which lacks native orchestration capabilities for data workflows. The community consensus strongly supports B, citing ADF's superior orchestration features, integration with diverse data sources, and visual workflow designer, while noting ACI's limitations in autoscaling and orchestration for data pipelines.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
Your team is building a data engineering and data science development environment that must meet the following requirements:
What should you do?
A
Build the environment in Apache Hive for HDInsight and use Azure Data Factory for orchestration.
B
Build the environment in Azure Databricks and use Azure Data Factory for orchestration.
C
Build the environment in Apache Spark for HDInsight and use Azure Container Instances for orchestration.
D
Build the environment in Azure Databricks and use Azure Container Instances for orchestration.
No comments yet.