
Ultimate access to all questions.
A business reporting team requires their dashboard data to be refreshed once every hour. The ETL pipeline responsible for data extraction, transformation, and loading typically takes 10 minutes to complete.
Under normal conditions, which configuration would best meet this service-level agreement (SLA) while minimizing operational costs?
A
Schedule the pipeline to run every hour on a dedicated, always-on interactive (all-purpose) cluster.
B
Configure a job to trigger automatically whenever new data files arrive in a specific cloud storage directory.
C
Use a Structured Streaming job with a 60-minute trigger interval on a running cluster.
D
Schedule a job to execute the pipeline every hour using a new job cluster.