
Ultimate access to all questions.
You are designing a stream processing pipeline that must process data within one partition efficiently. Describe how you would configure the pipeline to achieve this, including the partitioning strategy you would use and the mechanisms you would put in place to ensure data locality and minimize data movement.
A
Use a round-robin partitioning strategy and rely on the system's built-in data locality mechanisms.
B
Implement a key-based partitioning strategy and use local caching to minimize data movement.
C
Use a random partitioning strategy and manually manage data locality by assigning partitions to specific nodes.
D
Rely on the default partitioning strategy provided by the stream processing framework and optimize for throughput.