In the context of building a scalable and cost-effective machine learning pipeline for a multinational corporation, which of the following scenarios best illustrates the advantage of using a data lake for storing large datasets? Choose the two most correct options.