
Answer-first summary for fast verification
Answer: Store archived data in a distributed file system and use checksums to verify data integrity before replay.
Option B provides a robust solution for archiving and replaying stream data. A distributed file system offers scalability and durability, which are essential for storing large volumes of archived data. Using checksums to verify data integrity ensures that the data is consistent and accurate before it is replayed, which is crucial for historical analysis.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are designing a stream processing solution that must replay archived stream data for historical analysis. Describe how you would configure the system to enable this functionality, including the storage solution you would use for archiving and the mechanisms you would put in place to ensure data integrity and consistency during replay.
A
Use a relational database for archiving and rely on database transactions to ensure data integrity during replay.
B
Store archived data in a distributed file system and use checksums to verify data integrity before replay.
C
Archive data in a NoSQL database and use snapshotting to ensure consistency during replay.
D
Use a message broker for archiving and rely on its built-in replay functionality to handle data integrity.
No comments yet.