
Answer-first summary for fast verification
Answer: Implementing automated data backup and disaster recovery processes, continuously monitoring pipeline performance with alerts for anomalies, and enforcing strict data access controls based on roles and policies.
The best approach to ensure high data availability and minimal downtime while considering cost efficiency, compliance, and scalability involves: 1) Implementing automated data backup and disaster recovery processes to protect against data loss and ensure quick recovery, which is cost-efficient and scalable. 2) Continuously monitoring pipeline performance with alerts for anomalies to proactively address issues, reducing downtime. 3) Enforcing strict data access controls based on roles and policies to comply with data protection regulations and ensure data integrity. Options B, C, and D fail to meet one or more of the key requirements of cost efficiency, compliance, or scalability.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
As a Databricks Certified Data Engineer - Associate, you are tasked with designing a data governance strategy for an Azure Databricks environment to ensure high data availability and minimal downtime in a data pipeline. The strategy must consider cost efficiency, compliance with data protection regulations, and scalability to handle increasing data volumes. Which of the following approaches BEST meets these requirements? (Choose one option.)
A
Implementing automated data backup and disaster recovery processes, continuously monitoring pipeline performance with alerts for anomalies, and enforcing strict data access controls based on roles and policies.
B
Relying solely on manual backups without a disaster recovery plan, monitoring pipeline performance only during business hours, and applying minimal data access controls.
C
Implementing data backup without encryption, ignoring pipeline performance metrics, and enforcing data access controls without role-based assignments.
D
Disabling data backup processes to save costs, monitoring pipeline performance without setting up alerts, and allowing open access to all data for ease of use.