
Answer-first summary for fast verification
Answer: Create a patient pipeline that contains the four pipelines and use a schedule trigger.
## Analysis of the Pipeline Scheduling Scenario ### Requirements Summary: - **Four pipelines**: Ingest Data from System1, Ingest Data from System2, Populate Dimensions, Populate Facts - **Dependencies**: - Populate Dimensions depends on completion of both Ingest Data from System1 and Ingest Data from System2 - Populate Facts depends on completion of Populate Dimensions - Ingest Data from System1 and Ingest Data from System2 have no dependencies on each other - **Execution frequency**: All pipelines must run every 8 hours ### Evaluation of Options: **Option A: Add an event trigger to all four pipelines** - ❌ **Not suitable**: Event triggers respond to external events (e.g., file arrival, blob creation) rather than time-based scheduling - ❌ **Problem**: Would not ensure the required 8-hour execution cadence - ❌ **Issue**: No mechanism to enforce the dependency chain between pipelines **Option B: Add a schedule trigger to all four pipelines** - ❌ **Not suitable**: While schedule triggers can handle the 8-hour frequency, they cannot enforce the complex dependency requirements - ❌ **Problem**: All pipelines would run independently at the same time, violating the dependency constraints - ❌ **Issue**: Populate Dimensions might start before the ingestion pipelines complete **Option C: Create a parent pipeline that contains the four pipelines and use a schedule trigger** - ✅ **Optimal solution**: This approach properly addresses all requirements - ✅ **Dependency management**: The parent pipeline can use Execute Pipeline activities with proper dependency configuration - ✅ **Scheduling**: A schedule trigger ensures execution every 8 hours - ✅ **Execution order**: Can be configured as: - Execute Ingest Data from System1 and Ingest Data from System2 in parallel - Execute Populate Dimensions only after both ingestion pipelines complete - Execute Populate Facts only after Populate Dimensions completes **Option D: Create a parent pipeline that contains the four pipelines and use an event trigger** - ❌ **Not suitable**: While the parent pipeline structure handles dependencies correctly, event triggers are inappropriate for time-based scheduling - ❌ **Problem**: Event triggers respond to external events, not fixed time intervals - ❌ **Issue**: Cannot guarantee execution every 8 hours as required ### Why Option C is the Best Practice: 1. **Centralized orchestration**: A parent pipeline provides a single point of control for the entire workflow 2. **Dependency enforcement**: Execute Pipeline activities can be configured with proper success dependencies 3. **Scheduling compliance**: Schedule triggers are specifically designed for recurring time-based execution 4. **Maintainability**: Changes to the execution schedule or dependencies can be managed in one location 5. **Monitoring**: Provides a unified view of the entire data processing workflow ### Implementation Approach: The parent pipeline would contain: - Execute Pipeline activities for each of the four pipelines - Dependency configuration ensuring: - Populate Dimensions depends on successful completion of both ingestion pipelines - Populate Facts depends on successful completion of Populate Dimensions - A schedule trigger configured for 8-hour intervals This solution ensures the pipelines execute in the correct order while maintaining the required 8-hour execution frequency.
Ultimate access to all questions.
Author: LeetQuiz Editorial Team
You have the following Azure Data Factory pipelines:
Ingest Data from System1 and Ingest Data from System2 have no dependencies on each other. Populate Dimensions must execute after both Ingest Data from System1 and Ingest Data from System2 complete. Populate Facts must execute after Populate Dimensions completes. All pipelines must execute every eight hours.
What should you do to schedule the pipelines for execution?
A
Add an event trigger to all four pipelines.
B
Add a schedule trigger to all four pipelines.
C
Create a patient pipeline that contains the four pipelines and use a schedule trigger.
D
Create a patient pipeline that contains the four pipelines and use an event trigger.
No comments yet.