
Ultimate access to all questions.
In your role as a Databricks Certified Data Engineer - Professional, you are responsible for optimizing the cost of a Databricks environment that supports a variety of workloads, including data processing, machine learning, and ad-hoc analysis. The environment must remain scalable and performant while reducing costs. You need to analyze usage patterns, identify cost-saving opportunities, and implement changes to achieve cost optimization. Which of the following approaches is the BEST to meet these requirements? (Choose one option)
A
Manually review each Databricks cluster and notebook usage, making ad-hoc adjustments to cluster sizes and instance types based on workload requirements without a systematic analysis.
B
Utilize the Databricks Cost Management UI to systematically analyze usage patterns, identify underutilized clusters and idle resources, and optimize cluster configurations and auto-scaling settings based on data-driven insights.
C
Develop and implement a custom monitoring solution to track Databricks clusters and notebooks usage, then use this data to manually adjust cluster sizing, instance types, and auto-scaling settings.
D
Adopt a third-party cost optimization tool to analyze usage patterns and identify savings, relying solely on the tool's recommendations for implementing changes without further validation.