
Ultimate access to all questions.
In the context of designing a scalable and efficient data pipeline for a global e-commerce platform, which of the following strategies is the MOST effective to ensure the pipeline operates within acceptable performance thresholds under varying load conditions? Choose the best option.
A
Implementing a smaller subset of the production data for testing purposes to simulate load conditions.
B
Continuously monitoring pipeline metrics such as latency, throughput, and error rates, and configuring alerts for when these metrics exceed predefined thresholds.
C
Dynamically adjusting the number of processing nodes based on current load to optimize resource utilization and cost.
D
Eliminating all logging and monitoring features to reduce overhead and improve processing speed.
E
Both B and C are effective strategies for maintaining pipeline performance under varying load conditions.