Ultimate access to all questions.
In the context of managing large datasets in Delta Lake on Azure Databricks, consider a scenario where a data engineer needs to archive historical data older than 3 years to comply with data retention policies while ensuring minimal impact on query performance for current data. The dataset is partitioned by year and month. Which of the following approaches BEST leverages Delta Lake's partitioning feature to achieve this goal efficiently? Choose the correct option and explain why it is the best choice.