
Ultimate access to all questions.
You are designing an Azure Databricks table to persist an average of 20 million streaming events daily for use in incremental load pipeline jobs. The solution must minimize both storage costs and incremental load times.
What should you include in the design?
A
Partition by DateTime fields.
B
Sink to Azure Queue storage.
C
Include a watermark column.
D
Use a JSON format for physical data storage.