
Answer-first summary for fast verification
Answer: The model will not be able to find a unique best-fit solution
## Explanation The correct answer is **B. The model will not be able to find a unique best-fit solution**. This issue arises due to a phenomenon known as the **dummy variable trap**. The dummy variable trap occurs when: 1. **Perfect Multicollinearity**: Including all dummy variables for a categorical feature plus an intercept term creates perfect multicollinearity 2. **Linear Dependence**: The sum of all dummy variables equals 1 (for each observation), which is exactly the same as the intercept term 3. **Matrix Inversion Issues**: This makes the design matrix singular and non-invertible ### Why This Happens: - If you have a categorical variable with k categories, you should only create k-1 dummy variables - Including all k dummy variables plus an intercept creates perfect multicollinearity - The statistical software cannot compute unique parameter estimates ### Consequences: - The model cannot find a unique solution for the coefficients - Standard errors become infinite - Parameter estimates become unstable and unreliable ### Solution: - Drop one dummy variable category (the reference category) - Or remove the intercept term (less common) This issue is particularly important in financial risk modeling where accurate parameter estimation is crucial for default prediction.
Author: Tanishq Prabhu
Ultimate access to all questions.
No comments yet.
A financial institution is building a machine learning model to predict the likelihood of default for a portfolio of loans. The model includes several categorical variables, such as loan purpose and borrower credit score, which are transformed into dummy variables. If an intercept term and correlated dummy variables are included in the model, which of the following is a potential issue that may arise?
A
The model will have a single solution
B
The model will not be able to find a unique best-fit solution
C
The model will have a high bias
D
The model will have a high variance