Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.

In a scenario where you are working with multiple notebooks in a Databricks workspace that depend on a common Python library, and you need to ensure that the solution is scalable, maintainable, and adheres to best practices for dependency management, which of the following approaches would you choose? Consider the need for proper package structure, ease of updates, and minimal manual intervention. Choose the best option from the following:

Simulated

Upload the Python files directly to the Databricks workspace as a library without any package structure, and reference them in each notebook.

7.5%

Use the Databricks CLI to create a library with a Python package, ensuring the inclusion of __init__.py files for proper package structure, and reference this package in the notebooks.

Comments

Loading comments...

Embed the Python code directly into each notebook using %pip magic commands to install the required packages at runtime.

16.1%