第4章n 多元回归估计与假设检验.ppt
《第4章n 多元回归估计与假设检验.ppt》由会员分享,可在线阅读,更多相关《第4章n 多元回归估计与假设检验.ppt(88页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、第4章n多元回归估计与假设检验 Still waters run deep.流静水深流静水深,人静心深人静心深 Where there is life,there is hope。有生命必有希望。有生命必有希望2Parallels with Simple RegressionYi=b0+b1Xi1+b2Xi2+.bkXik+uib0 is still the interceptb1 to bk all called slope parameters,also called partial regression coefficients and any coefficient bj denote
2、 the change of Y with the changes of Xj as all the other independent variables fixed.u is still the error term(or disturbance)Still minimizing the sum of squared residuals,so have k+1 first order conditions3Obtaining OLS Estimates4Obtaining OLS Estimates,cont.The above estimated equation is called t
3、he OLS regression line or the sample regression function(SRF)the above equation is the estimated equation,is not the really equation.The really equation is population regression line which we dont know.We only estimate it.So,using a different sample,we can get another different estimated equation li
4、ne.The population regression line is5Interpreting Multiple Regression6An Example(Wooldridge,p76)The determination of wage(dollars per hour),wage:Years of education,educYears of labor market experience,experYears with the current employer,tenureThe relationship btw.wage and educ,exper,tenure:wage=b0+
5、b1educ+b2exper+b3tenure+uThe estimated equation as below:wage=-2.873+0.599educ+0.022exper+0.169tenure7A“Partialling Out”Interpretation8A“Partialling Out”Interpretation9“Partialling Out”continuedPrevious equation implies that regressing Y on X1 and X2 gives same effect of X1 as regressing Y on residu
6、als from a regression of X1 on X2This means only the part of Xi1 that is uncorrelated with Xi2 are being related to Yi,so were estimating the effect of X1 on Y after X2 has been“partialled out”10The wage determinationsThe estimated equation as below:wage=-2.873+0.599educ+0.022exper+0.169tenureNow,we
7、 first regress educ on exper and tenure to patial out the exper and tenures effects.Then we regress wage on the residuals of educ on exper and tenure.Whether we get the same result.?educ=13.575-0.0738exper+0.048tenure denote residuals residwage=5.896+0.599residWe can see that the coefficient of resi
8、d is the same of the coefficien of the variable educ in the first estimated equation.So is in the second equation.11Goodness-of-Fit:R212Goodness-of-Fit13Goodness-of-Fit(continued)How do we think about how well our sample regression line fits our sample data?Can compute the fraction of the total sum
9、of squares(SST)that is explained by the model,call this the R-squared of regression R2=ESS/TSS=1 RSS/TSS14Goodness-of-Fit(continued)15More about R-squaredwR2 can never decrease when another independent variable is added to a regression,and usually will increasewBecause R2 will usually increase with
10、the number of independent variables,it is not a good way to compare models16An ExamplewUsing wage determination model to show that when we add another new independent variable will increase the value of R2.17Adjusted R-SquaredwR2 is simply an estimate of how much variation in y is explained by X1,X2
11、,Xk.That is,wRecall that the R2 will always increase as more variables are added to the modelwThe adjusted R2 takes into account the number of variables in a model,and may decrease18Adjusted R-Squared(cont)wMost packages will give you both R2 and adj-R2w You can compare the fit of 2 models(with the
12、same Y)by comparing the adj-R2awge=-3.391+0.644educ+0.070exper adj-R2=0.2222awge=-2.222+0.569educ+0.190tenure adj-R2=0.2992w You cannot use the adj-R2 to compare models with different ys(e.g.y vs.ln(Y)awge=-3.391+0.644educ+0.070exper adj-R2=0.2222alog(wge)=0.404+0.087educ+0.026exper adj-R2=0.3059aBe
13、cause the variance of the dependent variables is different,the comparation btw them make no sense.19Assumptions for Unbiasedness20Assumptions for UnbiasednessPopulation model is linear in parameters:Y=b0+b1X1+b2X2+bkXk+uWe can use a sample of size n,(Xi1,Xi2,Xik,Yi):i=1,2,n,from the population model
14、,so that the sample model is Yi=b0+b1Xi1+b2Xi2+bkXik+ui Cov(uXi)=0,E(uXi)=0,i=1,2,n.E(u|X1,X2,Xk)=0,implying that all of the explanatory variables are exogenous.E(u|X)=0,where X=(X1,X2,Xk),which will reduce to E(u)=0 if independent variables X are not random variables.None of the Xs is constant,and
15、there are no exact linear relationships among them.The new additional assumption.21About multicollinearityIt does allow the independent variables to be correlated;they just cannot be perfectly linear correlated.Student performance:colGPA=b0+b1 hsGPA+b2ACT+b3 skipped+uConsumption function:consum=b0+b
16、1inc+b2inc2+uBut,the following is invalid:log(consum)=b0+b1log(inc)+b2log(inc2)+uIn this case,we can not estimate the regression coefficients b1,b2.22Unbiasedness of OLS estimationUnder the three assumptions above,we can get23Too Many or Too Few Variables24Too Many or Too Few VariablesWhat happens i
17、f we include variables in our specification that dont belong?There is no effect on our parameter estimate,and OLS remains unbiasedWhat if we exclude a variable from our specification that does belong?OLS will usually be biased 25Omitted Variable Bias26Omitted Variable Bias(cont)27Omitted Variable Bi
18、as(cont)28Omitted Variable Bias(cont)There are two cases where the estimated parameter is unbiased:If b2=0,so that X2 does not appear in the true modelIf tilde of d1=0,the tilde b1 is unbiased for b1 29Summary of Direction of BiasCorr(X1,X2)0Corr(X1,X2)0Positive biasNegative biasb2 0 and H1:bj 0One-
19、Sided Alternatives(cont)0ca(1-a)Fail to rejectreject58An Example:Hourly Wage EquationwWage determination:(wooldridge,p123)wlog(wge)=0.284+0.092educ+0.0041exper+0.022tenurew (0.104)(0.007)(0.0017)(0.003)w n=526 R2=0.316wWhether the return to exper,controlling for educ and tenure,is zero in the popula
20、tion,against the alternative that it is positive.wH0:bexper=0 vs.H1:bexper 0wThe t statistic is t=0.0041/0.00172.41wThe degree of freedom:df=n-k-1=526-3-1=522wThe critical value of 5%is 1.645wAnd the t statistic is larger than the critical value,ie.,2.411.645wThat is,we will reject the null hypothes
21、is and bexper is really positive.01.645(1-a)Fail to reject5%reject59Another example:Student Performance and School SizewWhether the school size has effect on student performance?amath10,math test scores,reveal the student performanceatotcomp,average annual teacher compensationastaff,the number of st
22、aff per one thousand studentsaenroll,student enrollment,reveal the school size.wThe Model Equationamath10=b0+b1totcomp+b2staff+b3enroll+uaH0:b3=0,H1:b3-1.645,so we cant reject the null hypothesis.-1.645reject-09160One-sided vs Two-sidedwBecause the t distribution is symmetric,testing H1:bj 0 is stra
23、ightforward.The critical value is just the negative of beforewWe can reject the null if the t statistic c,then we fail to reject the nullwFor a two-sided test,we set the critical value based on a/2 and reject H0:bj=0 if the absolute value of the t statistic c61yi =b0 +b1Xi1 +bkXik+uiH0:bj=0 H1:bj 0c
24、0a/2(1-a)-ca/2Two-Sided Alternativesrejectrejectfail to reject62Summary for H0:b bj=0wUnless otherwise stated,the alternative is assumed to be two-sidedwIf we reject the null,we typically say“Xj is statistically significant at the 100a%level”wIf we fail to reject the null,we typically say“Xj is stat
25、istically insignificant at the 100a%level”63An Example:Determinants of College GPA(wooldridge,p128)wVariables:acolGPA,college GPAaskipped,the average number of lectures missed per weekaACT,achievement test scoreahsGPA,high school GPAwThe estimated modelaolGPA=1.39+0.412 hsGPA+0.015 ACT 0.083 skipped
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 多元 回归 估计 假设检验
限制150内