Ultimate access to all questions.
Discuss the challenges posed by 'smalls' (tiny files, scanning overhead, over partitioning) in a data processing environment. Provide a detailed analysis of how these challenges impact query performance and propose a solution involving the use of CDF and optimized partitioning to address these challenges.