n 多元回归估计与假设检验.pptx
《n 多元回归估计与假设检验.pptx》由会员分享,可在线阅读,更多相关《n 多元回归估计与假设检验.pptx(88页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、1Parallels with Simple RegressionYi=b0+b1Xi1+b2Xi2+.bkXik+uib0 is still the interceptb1 to bk all called slope parameters,also called partial regression coefficients and any coefficient bj denote the change of Y with the changes of Xj as all the other independent variables fixed.u is still the error
2、 term(or disturbance)Still minimizing the sum of squared residuals,so have k+1 first order conditions第1页/共88页2Obtaining OLS Estimates第2页/共88页3Obtaining OLS Estimates,cont.The above estimated equation is called the OLS regression line or the sample regression function(SRF)the above equation is the es
3、timated equation,is not the really equation.The really equation is population regression line which we dont know.We only estimate it.So,using a different sample,we can get another different estimated equation line.The population regression line is第3页/共88页4Interpreting Multiple Regression第4页/共88页5An
4、Example(Wooldridge,p76)The determination of wage(dollars per hour),wage:Years of education,educYears of labor market experience,experYears with the current employer,tenureThe relationship btw.wage and educ,exper,tenure:wage=b0+b1educ+b2exper+b3tenure+uThe estimated equation as below:wage=educ+exper+
5、tenure第5页/共88页6A“Partialling Out”Interpretation第6页/共88页7A“Partialling Out”Interpretation第7页/共88页8“Partialling Out”continuedPrevious equation implies that regressing Y on X1 and X2 gives same effect of X1 as regressing Y on residuals from a regression of X1 on X2This means only the part of Xi1 that i
6、s uncorrelated with Xi2 are being related to Yi,so were estimating the effect of X1 on Y after X2 has been“partialled out”第8页/共88页9The wage determinationsThe estimated equation as below:wage=-educ+exper+tenureNow,we first regress educ on exper and tenure to patial out the exper and tenures effects.T
7、hen we regress wage on the residuals of educ on exper and tenure.Whether we get the same result.?educ=exper+tenure denote residuals residwageresidWe can see that the coefficient of resid is the same of the coefficien of the variable educ in the first estimated equation.So is in the second equation.第
8、9页/共88页10Goodness-of-Fit:R2第10页/共88页11Goodness-of-Fit第11页/共88页12Goodness-of-Fit(continued)How do we think about how well our sample regression line fits our sample data?Can compute the fraction of the total sum of squares(SST)that is explained by the model,call this the R-squared of regression R2=ES
9、S/TSS=1 RSS/TSS第12页/共88页13Goodness-of-Fit(continued)第13页/共88页14More about R-squaredR2 can never decrease when another independent variable is added to a regression,and usually will increaseBecause R2 will usually increase with the number of independent variables,it is not a good way to compare model
10、s第14页/共88页15An ExampleUsing wage determination model to show that when we add another new independent variable will increase the value of R2.第15页/共88页16Adjusted R-SquaredR2 is simply an estimate of how much variation in y is explained by X1,X2,Xk.That is,Recall that the R2 will always increase as mo
11、re variables are added to the modelThe adjusted R2 takes into account the number of variables in a model,and may decrease第16页/共88页17Adjusted R-Squared(cont)Most packages will give you both R2 and adj-R2 You can compare the fit of 2 models(with the same Y)by comparing the adj-R2wge=-+educ+exper adj-R
12、2wge=-educ+tenure adj-R2 You cannot use the adj-R2 to compare models with different ys(e.g.y vs.ln(Y)wge=-+educ+exper adj-R2log(wge)=+educ+exper adj-R2Because the variance of the dependent variables is different,the comparation btw them make no sense.第17页/共88页18Assumptions for Unbiasedness第18页/共88页1
13、9Assumptions for UnbiasednessPopulation model is linear in parameters:Y=b0+b1X1+b2X2+bkXk+uWe can use a sample of size n,(Xi1,Xi2,Xik,Yi):i=1,2,n,from the population model,so that the sample model is Yi=b0+b1Xi1+b2Xi2+bkXik+ui Cov(uXi)=0,E(uXi)=0,i=1,2,n.E(u|X1,X2,Xk)=0,implying that all of the expl
14、anatory variables are exogenous.E(u|X)=0,where X=(X1,X2,Xk),which will reduce to E(u)=0 if independent variables X are not random variables.None of the Xs is constant,and there are no exact linear relationships among them.The new additional assumption.第19页/共88页20About multicollinearityIt does allow
15、the independent variables to be correlated;they just cannot be perfectly linear correlated.Student performance:colGPA=b0+b1 hsGPA+b2ACT+b3 skipped+uConsumption function:consum=b0+b1inc+b2inc2+uBut,the following is invalid:log(consum)=b0+b1log(inc)+b2log(inc2)+uIn this case,we can not estimate the re
16、gression coefficients b1,b2.第20页/共88页21Unbiasedness of OLS estimationUnder the three assumptions above,we can get第21页/共88页22Too Many or Too Few Variables第22页/共88页23Too Many or Too Few VariablesWhat happens if we include variables in our specification that dont belong?There is no effect on our parame
17、ter estimate,and OLS remains unbiasedWhat if we exclude a variable from our specification that does belong?OLS will usually be biased 第23页/共88页24Omitted Variable Bias第24页/共88页25Omitted Variable Bias(cont)第25页/共88页26Omitted Variable Bias(cont)第26页/共88页27Omitted Variable Bias(cont)There are two cases
18、where the estimated parameter is unbiased:If b2=0,so that X2 does not appear in the true modelIf tilde of d1=0,the tilde b1 is unbiased for b1 第27页/共88页28Summary of Direction of BiasCorr(X1,X2)0Corr(X1,X2)0Positive biasNegative biasb2 0 and H1:bj 0One-Sided Alternatives(cont)0ca(1-a)Fail to rejectre
19、ject第56页/共88页57An Example:Hourly Wage EquationWage determination:(wooldridge,p123)log(wgeeduc+exper+tenure (0.104)(0.007)(0.0017)(0.003)n=526 R2Whether the return to exper,controlling for educ and tenure,is zero in the population,against the alternative that it is positive.H0:bexper=0 vs.H1:bexper 0
20、The t statistic is tThe degree of freedom:df=n-k-1=526-3-1=522That is,we will reject the null hypothesis and bexper is really positive.01.645(1-a)Fail to reject5%reject第57页/共88页58Another example:Student Performance and School SizeWhether the school size has effect on student performance?math10,math
21、test scores,reveal the student performancetotcomp,average annual teacher compensationstaff,the number of staff per one thousand studentsenroll,student enrollment,reveal the school size.The Model Equationmath10=b0+b1totcomp+b2staff+b3enroll+uH0:b3=0,H1:b30The Estimated Equationmath10=2.274+0.00046 to
22、tcomp+0.048 staff-0.00020 enroll (6.113)(0.00010)(0.040)(0.00022)n=408,R2=df=408-3-1=404,t -0.91,c,so we cant reject the null hypothesis.-1.645reject-091第58页/共88页59One-sided vs Two-sidedBecause the t distribution is symmetric,testing H1:bj 0 is straightforward.The critical value is just the negative
23、 of beforeWe can reject the null if the t statistic c,then we fail to reject the nullFor a two-sided test,we set the critical value based on a/2 and reject H0:bj=0 if the absolute value of the t statistic c第59页/共88页60yi =b0 +b1Xi1 +bkXik+uiH0:bj=0 H1:bj 0c0a/2(1-a)-ca/2Two-Sided Alternativesrejectre
24、jectfail to reject第60页/共88页61Summary for H0:b bj=0Unless otherwise stated,the alternative is assumed to be two-sidedIf we reject the null,we typically say“Xj is statistically significant at the 100a%level”If we fail to reject the null,we typically say“Xj is statistically insignificant at the 100a%le
25、vel”第61页/共88页62An Example:Determinants of College GPA(wooldridge,p128)Variables:colGPA,college GPAskipped,the average number of lectures missed per weekACT,achievement test scorehsGPA,high school GPAThe estimated modelolGPA=1.39+0.412 hsGPA+0.015 ACT 0.083 skipped (0.33)(0.094)(0.011)(0.026)n=141,R2
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 多元 回归 估计 假设检验
限制150内