
Explanation:
In Azure Stream Analytics, the watermark delay metric represents the time difference between the current processing time and the latest event timestamp that has been processed. When the average watermark delay consistently exceeds the configured late arrival tolerance, this indicates that the job is falling behind in processing events in a timely manner.
D: The job lacks the resources to process the volume of incoming data - This is the most likely cause because:
A: Events whose application timestamp is earlier than their arrival time by more than five minutes arrive as inputs - While late-arriving events can contribute to delays, the late arrival tolerance is specifically designed to handle these scenarios. Events arriving within the tolerance window are processed normally, and those outside it are dropped or handled according to policy.
B: There are errors in the input data - Data errors typically result in failed events or output errors, not necessarily sustained watermark delays. While data quality issues can impact processing, they don't consistently cause watermark delays exceeding late arrival tolerance.
C: The late arrival policy causes events to be dropped - The late arrival policy dropping events is actually a consequence of the delay problem, not the cause. When events arrive too late (beyond the tolerance), they get dropped, but this doesn't explain why the watermark delay itself is consistently high.
A consistently high watermark delay exceeding the late arrival tolerance primarily indicates throughput and resource capacity issues rather than data timing or policy configuration problems. The solution typically involves scaling the job resources to match the input data volume.
Ultimate access to all questions.
No comments yet.
You are monitoring an Azure Stream Analytics job using Azure Metrics. You observe that over the past 12 hours, the average watermark delay has been consistently higher than the configured late arrival tolerance. What is a potential cause for this behavior?
A
Events whose application timestamp is earlier than their arrival time by more than five minutes arrive as inputs.
B
There are errors in the input data.
C
The late arrival policy causes events to be dropped.
D
The job lacks the resources to process the volume of incoming data.