Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.

Explanation:

To control the amount of data processed in each micro-batch, the following input limit options can be used: maxFilesPerTrigger and maxBytesPerTrigger. It's important to note that if both are used in a single query, the data processed in each micro-batch will be the lesser of maxFilesPerTrigger and maxBytesPerTrigger. Whichever limit is reached first, the data processing will be stopped for that micro-batch of the stream. For more information, refer to the documentation on limiting input rate for streaming reads.

Explanation:

Comments (0)

No comments yet.

An upstream system sends more than 1000 files every hour, each containing between 5000 to 30000 records, with file sizes ranging from 580MB to 1.22 GB. The data is ingested using a medallion architecture. To limit the amount of data processed in each micro-batch, which of the following options is valid for controlling the input rate while reading the stream?

Real Exam

maxFilesPerTrigger and maxRowsPerTrigger

10.9%

maxBytesPerTrigger and maxRecordsPerTrigger

12.4%

maxBytesPerTrigger and maxFilesPerTrigger

55.5%

maxFilesPerTrigger and maxRecordsPerTrigger

10.9%

maxFilesPerTrigger, maxBytesPerTrigger, maxRowsPerTrigger, and maxRecordsPerTrigger

10.2%