
Answer-first summary for fast verification
Answer: In environments where interactive code will be executed, production data should only be accessible with read permissions; creating isolated databases for each environment further reduces risks.
The best practice in this scenario is to ensure production data is accessed with read-only permissions in non-production environments and to isolate databases per environment to minimize risks. Option B correctly advocates for read access to production data and isolated databases, reducing the chance of accidental writes or data corruption. Other options are flawed: A discourages environment separation (risky), C relies on code declarations (error-prone), D overestimates Delta Lake's protections, and E assumes passthrough credentials negate admin risks (insecure).
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
A data engineering team is setting up development, testing, and production environments for a new data pipeline migration. They need thorough testing of both code and resulting data, aiming to use data closely resembling production.
A junior engineer proposes mounting production data directly to development and testing environments, allowing pre-production code to run against real data. Since all users have admin access in the development environment, the junior engineer volunteers to configure permissions and mount the data.
Which statement reflects the most appropriate best practice in this scenario?
A
All development, testing, and production code and data should exist in a single, unified workspace; creating separate environments for testing and development complicates administrative overhead.
B
In environments where interactive code will be executed, production data should only be accessible with read permissions; creating isolated databases for each environment further reduces risks.
C
As long as code in the development environment declares USE dev_db at the top of each notebook, there is no possibility of inadvertently committing changes back to production data sources.
D
Because Delta Lake versions all data and supports time travel, it is not possible for user error or malicious actors to permanently delete production data; as such, it is generally safe to mount production data anywhere.
E
Because access to production data will always be verified using passthrough credentials, it is safe to mount data to any Databricks development environment.