
Ultimate access to all questions.
In a high-traffic data engineering environment, your team is managing a Delta Lake table that contains millions of rows and is frequently accessed by multiple concurrent users for both read and write operations. The organization prioritizes data consistency, performance, and scalability. Given these requirements, which of the following strategies would BEST ensure data consistency and prevent conflicts while maintaining high availability and performance? Choose the single best option.
A
Restrict the Delta table to read-only operations during peak hours to avoid conflicts, scheduling write operations during off-peak times.
B
Leverage Delta Lake's Optimistic Concurrency Control feature to allow concurrent writes, automatically detecting and resolving conflicts to ensure data consistency without sacrificing performance.
C
Migrate the data to a distributed NoSQL database that partitions data across multiple nodes, aiming to distribute the load and reduce conflict potential.
D
Develop a custom application layer that manually checks for data conflicts before any write operation, requiring developers to implement conflict resolution logic for each transaction.