
Answer-first summary for fast verification
Answer: Generate a new branch in Databricks, commit the changes there, and then push these modifications to the associated Git provider.
The most efficient approach for the data scientist to implement changes without disrupting the project's daily operations would be to establish a new branch within Databricks, commit the modifications to this branch, and then push these updates to the associated Git provider. This methodology permits the review and testing of changes before they are merged with the main codebase, ensuring the project's daily routine remains unaffected until the changes are confirmed and incorporated.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
A novice data scientist has recently joined an ongoing machine learning project that operates as a daily retraining scheduled job, housed in a Databricks Repository. Their task is to enhance the feature engineering of the pipeline's preprocessing phase without disrupting the daily operations. What is the best strategy for the data scientist to adopt?
A
Duplicate the project's notebooks into a Databricks Workspace folder and implement the necessary adjustments there.
B
Clone the project's notebooks into a separate Databricks Repository and implement the required alterations there.
C
Temporarily halt the project's automatic daily operations and modify the existing code in its original location.
D
Generate a new branch in Databricks, commit the changes there, and then push these modifications to the associated Git provider.
E
Initiate a new Git repository, integrate this into Databricks, and transfer the original code from the current repository to this new one before making modifications.
No comments yet.