
Answer-first summary for fast verification
Answer: Use the cluster's library management feature to install the libraries globally, and schedule regular updates for these libraries.
Option B is the most efficient approach for ensuring that all notebooks have access to the required libraries without the need for manual installation by each user. By using the cluster's library management feature, the libraries can be installed globally, and regular updates can be scheduled to keep them up-to-date, ensuring that the team has access to the latest features and bug fixes.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are responsible for setting up a Databricks cluster for a team of data scientists who will be working on a variety of machine learning projects. The team has requested that certain Python libraries, including scikit-learn and pandas, be pre-installed on the cluster. Describe the process you would follow to ensure that these libraries are available to all notebooks on the cluster, and explain how you would handle updates to these libraries in the future.
A
Install the libraries manually in each notebook session as they are needed.
B
Use the cluster's library management feature to install the libraries globally, and schedule regular updates for these libraries.
C
Ask each data scientist to install the libraries they need in their individual notebook sessions.
D
Create a script that installs the libraries in each notebook session automatically.
No comments yet.