In a Spark MLlib project, you are tasked with comparing the performance of linear regression and decision tree models on a large dataset. Which of the following evaluation metrics would be most appropriate for comparing the performance of these models, and why?