Ultimate access to all questions.
You are overseeing your company's data lake on BigQuery, with data ingestion pipelines pulling data from Pub/Sub into BigQuery tables. After a new pipeline version was deployed, there was a 50% increase in daily stored data, with some tables' daily partition sizes doubling, despite no change in Pub/Sub data volumes. What steps should you take to investigate and resolve this sudden data increase?