
Ultimate access to all questions.
In the context of building a scalable and cost-effective machine learning pipeline for a multinational corporation, which of the following scenarios best illustrates the advantage of using a data lake for storing large datasets? Choose the two most correct options.
A
The need to process and analyze real-time transactional data from global sales with millisecond latency.
B
The requirement to store petabytes of raw, unstructured social media data for future sentiment analysis projects.
C
The necessity to quickly retrieve small subsets of structured data for daily operational reports.
D
The objective to minimize storage costs while retaining the flexibility to store and analyze both structured and unstructured data over long periods.
E
The strategy to leverage high-speed data retrieval for interactive dashboard visualizations.