You are working on a data transformation project using Apache Spark. You have a large dataset with multiple columns, and you need to perform a series of transformations on the data. Which of the following transformation techniques would be most suitable for this scenario?