This is a tricky question that needs application of the omitted variables formula. Recall that if the regression model is stated as:
Yi=α+β1X1i+β2X2i+ei
If we omit X2 from the estimated model, then the model is given by:
Yi=α+β1X1i+ei
Now, in large sample sizes, the OLS estimator β^1 converges to:
β1+β2δ1
Where:
δ1=Var(X1)Cov(X1,X2)
Maintaining this line of thought, the first regression equation suggests that:
β1+β2δ1=0.5633
And the second equation suggests that:
β2+β1δ2=−0.7633
Where δ2=Var(X2)Cov(X1,X2)
Given:
- Cov(X1,X2)=0.603
- Var(X1)=0.874
- Var(X2)=0.75
We can calculate:
δ1=0.8740.603=0.6899
δ2=0.750.603=0.804
Now we have two equations:
- β1+0.6899β2=0.5633‘2
. 0.80‘4β1+β2=−0.7633
Solving these simultaneous equations:
From equation 1: β1=0.5633−0.6899β2
Substitute into equation 2:
$0.804(0.5633 - 0.6899\beta_2) + \beta_2 = -0.7633‘0.4529 - 0.5547\beta_2 + \beta_2 = -0.7633$ 0.45‘29+0.4453β2=−0.7633
$0.4453\beta_2 = -1.2162\beta_2 = -2.7312$
Then β1=0.5633−0.6899(−2.7312)=0.5633+1.884=2.4473
Now for the intercept, we need to consider that the intercept from the first regression (0.5767) is actually α+β2×mean(X2) when X1 is omitted, but since we don't have means, we can use the fact that the intercept should satisfy:
From the first regression: Y^=0.5767+0.5633X1
From the second regression: Y^=0.6767−0.7633X2
When we have both variables, the intercept α^ should be such that:
Y^=α^+β1X1+β2X2
Given the calculated coefficients β1=2.4473 and β2=−2.7312, and matching the intercept from either equation, we get the expression in option D: α^=Y^i−2.4476X1i+2.7312X2i (note the sign change for β2 since it's negative).