Ultimate access to all questions.
Consider a scenario where you need to process a large volume of semi-structured data using Apache Spark in a cloud environment. The data includes log files from multiple servers and needs to be transformed into a structured format for analysis. Describe the steps you would take to achieve this, including how you would optimize the Spark jobs for performance and cost efficiency.