Microsoft Azure Data Engineer Associate - DP-203

Ultimate access to all questions.

Design a data pipeline in Azure Data Factory that schedules the execution of Spark jobs on an Azure Databricks cluster. The pipeline needs to handle dynamic data inputs and adjust the cluster size based on the volume of data. Describe how you would manage the scheduling of these Spark jobs and the considerations for scaling the Databricks cluster.

Simulated

Schedule Spark jobs using a fixed-time schedule trigger and use a static-sized Databricks cluster.

0.0%

Use an Event-based trigger to start Spark jobs and configure the Databricks cluster to autoscale based on current workload.

Loading comments...