
Ultimate access to all questions.
A data engineer needs to determine whether to use the built-in Databricks Notebooks versioning or version their project using Databricks Repos. Which of the following is an advantage of using Databricks Repos over the Databricks Notebooks versioning?
A
Databricks Repos automatically saves development progress
B
Databricks Repos supports the use of multiple branches
C
Databricks Repos allows users to revert to previous versions of a notebook
D
Databricks Repos provides the ability to comment on specific changes
E
Databricks Repos is wholly housed within the Databricks Lakehouse Platform
Explanation:
Correct Answer: B - Databricks Repos supports the use of multiple branches
Why this is correct:
Multiple Branch Support: Databricks Repos integrates with Git repositories, which inherently support branching. This allows data engineers to:
Comparison with Built-in Notebooks Versioning:
Why other options are incorrect:
A: Both Databricks Notebooks versioning and Databricks Repos automatically save development progress. This is not a distinguishing advantage.
C: Both systems allow users to revert to previous versions of a notebook. Built-in versioning provides this functionality through the notebook's revision history.
D: Commenting on specific changes is typically a feature of Git platforms (like GitHub, GitLab, Bitbucket) rather than Databricks Repos itself. Databricks Repos provides Git integration but doesn't inherently provide code review/commenting features within the Databricks interface.
E: Both Databricks Notebooks versioning and Databricks Repos are housed within the Databricks Lakehouse Platform, so this is not a distinguishing advantage.
Key Takeaway: The primary advantage of Databricks Repos over built-in notebook versioning is its Git integration, which enables proper version control workflows including branching, merging, and collaboration through external Git repositories.