
Answer-first summary for fast verification
Answer: The job lacks the resources to process the volume of incoming data.
## Explanation In Azure Stream Analytics, the **watermark delay** metric represents the time difference between the current processing time and the latest event timestamp that has been processed. When the average watermark delay consistently exceeds the configured late arrival tolerance, this indicates that the job is falling behind in processing events in a timely manner. ### Why Option D is Correct **D: The job lacks the resources to process the volume of incoming data** - This is the most likely cause because: - **Resource Constraints**: If the streaming job doesn't have sufficient Streaming Units (SUs) or computing resources to handle the incoming data rate, events will queue up and processing will lag behind. - **Bottleneck Scenario**: When input data volume exceeds the job's processing capacity, the system cannot keep up with real-time processing, causing the watermark delay to increase beyond acceptable thresholds. - **Scalability Issue**: This scenario often requires scaling up the job by increasing the number of Streaming Units to match the input data volume. ### Why Other Options Are Less Suitable **A: Events whose application timestamp is earlier than their arrival time by more than five minutes arrive as inputs** - While late-arriving events can contribute to delays, the late arrival tolerance is specifically designed to handle these scenarios. Events arriving within the tolerance window are processed normally, and those outside it are dropped or handled according to policy. **B: There are errors in the input data** - Data errors typically result in failed events or output errors, not necessarily sustained watermark delays. While data quality issues can impact processing, they don't consistently cause watermark delays exceeding late arrival tolerance. **C: The late arrival policy causes events to be dropped** - The late arrival policy dropping events is actually a *consequence* of the delay problem, not the cause. When events arrive too late (beyond the tolerance), they get dropped, but this doesn't explain why the watermark delay itself is consistently high. ### Key Takeaway A consistently high watermark delay exceeding the late arrival tolerance primarily indicates **throughput and resource capacity issues** rather than data timing or policy configuration problems. The solution typically involves scaling the job resources to match the input data volume.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are monitoring an Azure Stream Analytics job using Azure Metrics. You observe that over the past 12 hours, the average watermark delay has been consistently higher than the configured late arrival tolerance. What is a potential cause for this behavior?
A
Events whose application timestamp is earlier than their arrival time by more than five minutes arrive as inputs.
B
There are errors in the input data.
C
The late arrival policy causes events to be dropped.
D
The job lacks the resources to process the volume of incoming data.
No comments yet.