You are tasked with designing a data pipeline that needs to handle a large volume of data in real-time. Which distributed computing framework would be most suitable for this task, and why? | AWS Certified Data Engineer - Associate Quiz - LeetQuiz