1.82k likes | 3.78k Views
Simple and multiple regression analysis in matrix form. Least square Beta estimation Simple linear regression Multiple regression with two predictors Multiple regression with three predictors Sum of square R 2 Test on b parameters Covariance matrix of the b Standard error of the b.
E N D
Simple and multiple regression analysis in matrix form • Leastsquare • Betaestimation • Simple linear regression • Multiple regression with two predictors • Multiple regression with three predictors • Sum of square • R2 • Test on bparameters • Covariance matrix of the b • Standard error of the b
Simple and multiple regression analysis in matrix form • Tests on individual predictors • Variance of individual predictors • Correlation between predictors • Standardized matrices • Correlation matrices • Sum of squares in Z • R2 in Z • R2between independent variables • Standard error of b in Z
Least square Starting from the general: The method of least squares estimate of the beta parameter minimizing the sum of squares due to error. In fact, if:
Least square You can estimate:
Simple linear regression intercepts slope
Multiple regression • Similar to the simple • A single dependent variable (Y) • Two or more independent variables (X) • Multiple correlation (rather than simple) • Estimation by least squares
Multiple regression Simple linear regression (var.: 1 dep., 1 indep.) Independent variables intercepts slope error Multiple linear regression (Var.:1 dep., 2 indep.)
Multiple regression matrix form X’X inversa
Multiple regression with three predictors In matrix notation is briefly expressed :
Sum squares The least squares method allows to check the following equality:
Sum squares Since in general: it's possible to derive that the sum of the squares of the distances of y from its average can be decomposed into the sum of squares due to regression and the sum of squares due to error, according to:
Sum squares It should be noted the equivalence of :
Sum squares In summary :
Adjusted R2YY’ Because the coefficient of determination depends on both the number of observations (n) that the number of independent variables (k) it is convenient to correct by the degrees of freedom. Adjusted R2YY’ In our example :
Test on b parameters • Once a regression model has been constructed, it may be important to confirm the goodness of fit(R-squared )of the model and the statistical significance of the estimated parameters. Statistical significance can be checked by an F-testof the overall fit, followed by t-tests of individual parameters
Test on b parameters • You can test the hypothesis of differences with 0 of the parameters bi taken together :
Test on b parameters k= Number of columns of the matrix X excluding X0 n= Number of observations in y
Test on b parameters k= Number of columns of the matrix X excluding X0 n= Number of observations in y
Covariance matrix of the b We denote: An estimate of the covariance matrix of the beta values result by:
Covariance matrix of the b Where the diagonal elements are an estimate of the variance of the single bi
Standard error of the b The standard error of the parameters can be calculated with the following formula: whereciiis the diagonal element inside the matrix(X’X)-1 corresponding to the parameter bi .
Standard error of the b Nota: quando il valore di cii è elevato il valore di sebi cresce, indicando che la variabile Xi ha un alto coefficiente di correlazione multipla con le altre variabili X.
Standard error of the b The standard error of the i can also be calculated in the following way: where the increase in R2i led to a decreases of the denominator of the ratio and, consequently, increases the value of the standard error of the parameterbi.
Tests on individual predictors • With the standard error of measurement associated with each biyou can make a t-test to verify:
Tests on individual predictors With the standard error of measurement associated with each bi is also possible to estimate the confidence interval for each parameter:
Tests on individual predictors In order to conduct a statistical test on the regression coefficients is necessary: • Calculate the SSreg for the model containing all the independent variables. • Calculate the SSreg for the model excluding the variable for which you want to test the significance (SS-i). • Perform an F-test with the numerator equal to the difference SSreg-SSi weighted for the difference between the degrees of freedom of the two models, and with denominator SSREs / (nk-1).
Tests on individual predictors To test, for example, only the weight of the first predictor compared to the total model, it is necessary to calculate a new matrix bi from the matrix Xi which was taken off the column belonging to the first predictor. From this follows immediately the calculation of SSi.
Tests on individual predictors Similarly we have: Same procedure is followed to test any subset of predictors.
Tests on individual predictors It is interesting to note that this test on a single predictor is equivalent to the t-test b1 = 0. When the numerator there is only one degree of freedom, that is in fact the equivalence:
Summary table On this occasion, none of the estimated parameters obtained statistical significance on the hypothesis bi 0
Variance of individual predictors Xi Using the matrix X'X we can calculate the variance of each variable Xi .