Ultimate access to all questions.
In the context of Spark Structured Streaming, consider a scenario where you are processing real-time sales data from multiple sources. The data includes timestamps, and due to network latency, some data arrives late. Your application must accurately reflect sales figures, including late-arriving data, up to 2 hours after the expected time. Additionally, the solution must minimize resource usage by efficiently managing state. Which of the following approaches BEST meets these requirements? Choose the correct option from the four provided.