Content deleted Content added
→top: tagging for cleanup |
KuyaMoHirowo (talk | contribs) m Fixed lint errors: missing end tag |
||
(14 intermediate revisions by 7 users not shown) | |||
Line 1:
{{Multiple issues|{{one source|date=November 2015}}{{
{{merge to|Maximum likelihood estimation|date=October 2017}}▼
▲{{Multiple issues|{{one source|date=November 2015}}{{sections|date=November 2015}}
{{Cleanup|reason=Formatting of mathematical formulas.|date=March 2018}}
}}
Partial (pooled) likelihood estimation for [[panel data]] is a [[Quasi-maximum likelihood estimate|quasi-maximum likelihood]] method for [[panel analysis]] that assumes that density of
==Description== Concretely, partial likelihood estimation uses the product of conditional densities as the density of the joint conditional distribution. This generality facilitates [[maximum likelihood]] methods in panel data setting because fully specifying conditional distribution of ''y<sub>i</sub>'' can be computationally demanding.<ref name= "Woolridge">Wooldridge, J.M., Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambridge, Mass.</ref> On the other hand, allowing for misspecification generally results in violation of information equality and thus requires robust [[standard error estimator]] for inference. In the following exposition, we follow the treatment in Wooldridge.<ref name= "Woolridge" /> Particularly, the asymptotic derivation is done under fixed-T, growing-N setting.
Line 11 ⟶ 12:
Writing the conditional density of y<sub>it</sub> given ''x<sub>it</sub>'' as ''f<sub>t</sub>'' (''y<sub>it</sub>'' | ''x<sub>it</sub>'';θ), the partial maximum likelihood estimator solves:
:
▲ \underset{\theta\in\Theta}{\operatorname{max}}\sum_{i=1}^N\sum_{t=1}^T \log f_t(y_{it} \mid x_{it}; \theta) </math>
In this formulation, the joint conditional density of ''y<sub>i</sub>'' given ''x<sub>i</sub>'' is modeled as ''Π<sub>t</sub>'' ''f<sub>t</sub>'' (''y<sub>it</sub>'' | ''x<sub>it</sub>'' ; θ). We assume that ''f<sub>t</sub> (y<sub>it</sub> |x<sub>it</sub> ; θ)'' is correctly specified for each ''t'' = 1,
But, it is not assumed that the joint conditional density is correctly specified. Under some regularity conditions, partial MLE is consistent and asymptotically normal.
By the usual argument for [[M-estimator]]s (details in Wooldridge <ref name= "Woolridge" />), the asymptotic variance of {{radic|''N''}} ''(θ<sub>MLE</sub>- θ<sub>0</sub>) is A<sup>−1</sup> BA<sup>−1</sup>'' where ''A<sup>−1</sup> = E[
== Pooled QMLE for Poisson models==
Pooled QMLE is a technique that allows estimating parameters when [[panel data]] is available with Poisson outcomes. For instance, one might have information on the number of patents files by a number of different firms over time. Pooled QMLE does not necessarily contain [[unobserved effects]] (which can be either [[random effects]] or [[fixed effects]]), and the estimation method is mainly proposed for these purposes. The computational requirements are less stringent, especially compared to [[fixed-effect Poisson model]]s, but the trade off is the possibly strong assumption of no [[unobserved heterogeneity]]. Pooled refers to pooling the data over the different time periods ''T'', while QMLE refers to the quasi-maximum likelihood technique.
The [[Poisson distribution]] of <math>y_i</math> given <math>x_i</math> is specified as follows:<ref name="CameronTrivedi">Cameron, C. A. and P. K. Trivedi (2015) Count Panel Data, Oxford Handbook of Panel Data, ed. by B. Baltagi, Oxford University Press, pp. 233–256</ref>
: <math>f(y_i \mid x_i ) = \frac{e^{-\mu_i} \mu_i^{y_i}}{y_i!}</math>
the starting point for Poisson pooled QMLE is the conditional mean assumption. Specifically, we assume that for some <math>b_0</math> in a compact parameter space '''B''', the conditional mean is given by<ref name="CameronTrivedi"/>
: <math> \operatorname E[y_t \mid x_t]=m(x_t, b_0) = \mu_t \text{ for } t= 1,\ldots, T. </math>
The compact parameter space condition is imposed to enable the use of [[M-estimator|M-estimation techniques]], while the conditional mean reflects the fact that the population mean of a Poisson process is the parameter of interest. In this particular case, the parameter governing the Poisson process is allowed to vary with respect to the vector <math>x_{t}\centerdot</math>.<ref name="CameronTrivedi"/> The function ''m'' can, in principle, change over time even though it is often specified as static over time.<ref name="Woolbridge2002">Wooldridge, J. (2002): Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambridge, Mass.</ref> Note that only the conditional mean function is specified, and we will get consistent estimates of <math>b_{0}</math> as long as this mean condition is correctly specified. This leads to the following first order condition, which represents the quasi-log likelihood for the pooled Poisson estimation:<ref name="CameronTrivedi"/><!--not sure if the correct ref is this one or the previous one-->
: <math>\ell_i(b)=\sum[y_{it} \log(m(x_{it},b))-m(x_{it},b)]</math>
A popular choice is <math>m=(x_t,b_0)=\exp(x_t b_0)</math>, as Poisson processes are defined over the positive real line.<ref name="Woolbridge2002"/> This reduces the conditional moment to an exponential index function, where <math>x_t b_0</math> is the linear index and exp is the link function.<ref>McCullagh, P. and J. A. Nelder (1989): Generalized Linear Models, CRC Monographs on Statistics and Applied Probability (Book 37), 2nd Edition, Chapman and Hall, London.</ref>
==References==
{{Reflist}}
{{Statistics}}
[[Category:M-estimators]]
[[Category:Panel data]]
[[Category:Probability distribution fitting]]
|