Reddit

Given a DataFrame `df` with columns `id`, `name`, and `timestamp`, how would you create a new DataFrame that removes duplicate rows based on the `id` column? Provide the Spark code to achieve this. | Databricks Certified Data Engineer - Associate Quiz - LeetQuiz