When developing a data ingestion pipeline that consolidates data from various sources using Apache Spark, which method would you employ to ensure high data quality across ingested datasets?

Real Exam

Powered ByGPT-5