Testing:
- Global Anova test and set up
- t-test for individual slope(
) - Generalized Linear / Partial F test
Gloabal:
- “Fit the full model” to the data:
- Obtain the least squares estimates of
and . - Determine the error sum of squares, which we denote “SSE
- Obtain the least squares estimates of
- “Fit the reduced model” to the data.:
- Obtain the least squares estimate of
. - Determine the error sum of squares, which we denote “SSE(0).”
- Obtain the least squares estimate of
or
note: at least one predictor is a significant predictor of Y
note:
Anova Table
Reject
Visual Compare reduced model and Full model
Full model:
Reduced model:

note:
SSE-Reduced > SSE-FULL, Adding predictors to a linear regression model will always reduces SSE. RMSE always selects the model with most predictors
Lower values of RMSE indicate a better fit of the model to the data, while higher values indicate a poorer fit. However, by itself, RMSE doesn’t tell you whether the model’s predictions are biased. It merely indicates the magnitude of the error.
T test for individual coefficient( ):
In SLR
Testing for
Test set up:
Rejection Rule :
Reject
Global T test:
In MLR, t test for
In matrix form:
Reject Ho if
If rejected,
ANOVA Test
sequential sum of squares
- sequential sum of squares(Type=1):
- definition:
- It is the reduction in the error sum of squares (SSE) when one or more predictor variables are added to the model.
- Or, it is the increase in the regression sum of squares (SSR) when one or more predictor variables are added to the model.
- note: Sequential sum of squares (Type I) breaks down the variability explained by each predictor as they’re added to the model. For each predictor, we assess how much additional variance it explains beyond preceding predictors. However, in cases of multicollinearity, the order of adding predictors can significantly influence the results.
- The sequential sum of squares measures either the reduction in the error sum of squares (SSE) when adding predictors or the increase in the regression sum of squares (SSR) due to the same addition.
- Theory model:
- definition:
For Test
- in this example,
denote to the error sum of squares when is the only predictor in the model, denotes the error sum of squares when and are both in the model - The rejection of indicate the significance of
given the predictors before it already in the model
note:
- all ss are collated in the above way, therefore, changing the order of the predictors in the model will CHANGE THE RESULT!
-
\begin{array}{ll} y \sim x_1+x_2+x_3 & s s\left(x_2\right)=s s\left(x_2 \mid x_1\right) \ y \sim x_3+x_2+x_1 & \text { ss }\left(x_2\right)=s s\left(x_2 \mid x_3\right) \ y \sim x_1+x_3+x_2 & s s\left(x_2\right)=s s\left(x_2 \mid x_1, x_3\right) \end{array}
Summary:
- t-test for Coefficients
- Test if each coefficient
is significantly different from zero. - The coefficients:
with variance
-statistic formula for :
where
- Hypothesis:
- Decision Rule: Reject
if
or if
- Partial F-test(Type II)
- Compares the fit of the chosen model to a null model.
- Hypothesis:
(Null Model) (Chosen Model)- F-statistic formula:
- Decision Rule: Reject
if
or if
- Note: Rejecting
indicates the chosen model is significantly better than the null model. However, it does not identify which predictors are significant.
- Type III (Marginal F-test):
- Analysis of Variance (ANOVA) in Regression
- Type I (Sequential F-test): Tests the significance of predictors in the order they are entered into the model.
- Type II (Partial F-test): Tests each predictor’s significance after controlling for other predictors.
- Type III: Marginal F test, tests the significance of each predictor by considering all possible orders of entry of the predictors into the model. This test is particularly useful when there are interactions and higher-order terms in the model, as it offers an unbiased test for each term.
reference: Class notes https://online.stat.psu.edu/stat501/lesson/6/6.3#:~:text=What%20is%20a%20%22sequential%20sum,are%20added%20to%20the%20model. https://stats.stackexchange.com/questions/20452/how-to-interpret-type-i-type-ii-and-type-iii-anova-and-manova