The unobserved heterogeneity distribution in duration analysis.docx
《The unobserved heterogeneity distribution in duration analysis.docx》由会员分享,可在线阅读,更多相关《The unobserved heterogeneity distribution in duration analysis.docx(14页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、Biometrika (2007), 94, 1, pp. 8799 doi:10.1093/biomet/asm013 2007 Biometrika Trust Printed in Great Britain The unobserved heterogeneity distribution in duration analysis BY JAAP H. ABBRING AND GERARD J. VAN DEN BERG Department of Economics, Free University Amsterdam, De Boelelaan 1105, 1081 HV Amst
2、erdam, The Netherlands jabbringfeweb.vu.nl gbergfeweb.vu.nl SUMMARY In a large class of hazard models with proportional unobserved heterogeneity, the distribution of the heterogeneity among survivors converges to a gamma distribution. This convergence is often rapid. We derive this result as a gener
3、al result for exponential mixtures and explore its implications for the specification and empirical analysis of univariate and multivariate duration models. Some key words: Duration analysis; Exponential mixture; Gamma distribution; Limit distribution; Mixed proportional hazard. 1. INTRODUCTION It i
4、s well known that duration analysis produces incorrect results if unobserved heterogeneity is ignored (Lancaster, 1990). On average, subjects with relatively high hazard rates for unobserved reasons leave the state of interest first, so that samples of survivors are selected. Differences between suc
5、h samples at different times reflect behavioural differences as well as this selection effect. Lancaster (1979) specified and estimated a proportional hazard model with multiplicative unobserved heterogeneity. This is called a mixed proportional hazard model and has subsequently become by far the mo
6、st popular duration model in econometrics. Van den Berg (2001) presents a survey. The model is typically estimated using methods that require parametric functional form assumptions on the heterogeneity distribution. Lancaster (1979) assumes a gamma distribution, as do Vaupel et al. (1979), who intro
7、duced the model in demography. Nickell (1979) assumes a discrete distribution, and others have made other choices (Van den Berg, 2001). Unfortunately, estimators of the mixed proportional hazard model are usually biased if the functional form of the heterogeneity distribution is misspecified. Extens
8、ive simulation evidence is provided by, for example, Baker & Melino (2000) and Bretagnolle & Huber- Carol (1988). Also, many empirical studies report that the estimates are sensitive to the functional form of the distribution (Heckman & Singer, 1984; Trussell & Richards, 1985; Hougaard et al., 1994;
9、 Keiding et al., 1997). As a result, studies in which mixed proportional hazard models are estimated have wrestled with the choice of a functional form for the heterogeneity distribution; see for example Heckman & Singer (1984). In general, there is no argument in favour of one choice over the other
10、. Also, formal results in the methodological studies by Heckman & Taber (1994), Kortram et al. (1995) and Horowitz (1999) indicate that duration data are rather uninformative about the shape of this distribution. In practice, researchers often choose a gamma mixing distribution for computational and
11、 expositional reasons; 88 JAAP H. ABBRING AND GERARD J. VAN DEN BERG all functions of interest have simple explicit expressions in this case (Lancaster, 1990). The mixed proportional hazard model with gamma heterogeneity is a preferred option in popular statistical packages like STATA, SAS, S-Plus a
12、nd SPSS. Recently developed semiparametric estimators for the model also assume gamma heterogeneity; for examples see Clayton (1978), Meyer (1990), Nielsen et al. (1992), Murphy (1994, 1995), Petersen et al. (1996) and references in Andersen et al. (1993). The results in this paper rationalize this
13、preference for the gamma distribution, and connect the many results that have been derived for the gamma case to a wider class of models. 2. A LIMIT RESULT FOR EXPONENTIAL MIXTURES 21. Exponential mixtures Let Z and V be nonnegative random variables such that pr(Z z|V) = exp(V z). (1) The marginal d
14、istribution of Z is therefore a mixture of exponential distributions with respect to the marginal distribution F of V : pr(Z z) = exp(vz)dF (v). 0 We examine the limiting behaviour of the distribution of V conditional on Z z as z . In particular, we examine the limiting behaviour of Gz(v) = pr (zV v
15、|Z z) . 22. Main result We adopt the definitions of Feller (1971, VIII8) of slow variation and regular variation at 0. DEFINITION 1. A positive function L defined on (0, ) is slowly varying at 0 if limy0 L(y)/L(y) = 1 for every fixed 0. DEFINITION 2. A positive function k defined on (0, ) is regular
16、ly varying with exponent 0, at v. We define the standard gamma distribution as := 1, , with density denoted by . Finally, we define the limiting case 0 such that 0(v) = 1 for all v 0, ). This is a degenerate distribution with all probability mass at zero. We now state the main result. Unobserved het
17、erogeneity distribution 89 z z z PROPOSITION 1. If Gz G as z , with G a proper distribution function, then G = for some 0. A necessary and sufficient condition for Gz ( 0) is that F is regularly varying with exponent at 0. Proof. The Laplace transform LGz of Gz is given by LGz (s) = exp(sv)dGz(v) =
18、LF z(s + 1) . 0 LF (z) First, suppose that Gz G as z , with G a proper distribution function, and denote the Laplace transform of G by LG. Then LGz LG as z by the continuity of the Laplace transform. Thus, lim LGz (s) = lim LF z(s + 1) z z LF (z) exists and is positive and nonincreasing on (0, ). By
19、 Feller (1971, VIII8, Lemma 1), the latter limit then necessarily equals (s + 1) for some 0. In turn, this implies that G = for some 0. Secondly, again by continuity of the Laplace transform, z(s + 1) Gz lim LF LF (z) = (s + 1), so that Gz if and only if LF is regularly varying with exponent at infi
20、nity. In turn, it follows from an Abelian/Tauberian theorem, like Theorem 3 of Feller (1971, XIII5), that this is true if and and only if F varies regularly with exponent at 0. Examples of continuous distributions that are regularly varying at 0 with exponent 0 are all distributions with densities t
21、hat have finite positive limits at 0, such as the exponential, uniform and truncated normal distributions, and all gamma and beta distributions. Examples with 0 also include some discrete distributions with dense support near 0. The case = 0 includes all distributions, including finitely discrete di
22、stributions, with a point mass at 0. An obvious example of a distribution that is not regularly varying at 0 is a distribution without support near 0. Let v0 := infv : F (v) 0 be the largest lower bound on the support of F . Let F 0 be the distribution of V v0 and G0 the distribution of z(V v0) cond
23、itional on Z z. Then Proposition 1 applies without change with F replaced by F 0 and Gz replaced by G0. 23. Speed of convergence In statistical applications results about the rate of convergence of Gz to G would be useful. The following example shows that no general result about this rate can be der
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- The unobserved heterogeneity distribution in duration analysis
链接地址:https://www.taowenge.com/p-8024.html
限制150内