In the context of designing a scalable and efficient data system for a machine learning project, during which phase is data typically collected and acquired from various sources, considering factors such as data volume, velocity, and variety? Choose the best option.