Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
In the context of real-time data ingestion into a lakehouse from multiple sources where conflicts are likely, which technique ensures data accuracy and consistency without significantly degrading performance?
A
Stage incoming data in a raw format, employing periodic batch reconciliation jobs to resolve conflicts and merge data.
B
Utilize streaming stateful operations to detect and resolve conflicts based on predefined business rules.
C
Apply a last-write-wins policy, using event timestamps to resolve conflicts during ingestion.
D
Implement a two-phase commit protocol across the ingestion pipeline to ensure atomicity and consistency of writes.