Zero-inflated Conway-Maxwell Poisson Distribution to Analyze Discrete Data

Shin Zhu Sim; Ramesh C. Gupta; Seng Huat Ong

doi:10.1515/ijb-2016-0070

Article

Zero-inflated Conway-Maxwell Poisson Distribution to Analyze Discrete Data

Shin Zhu Sim , Ramesh C. Gupta and Seng Huat Ong

Published/Copyright: January 9, 2018

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information

From the journal The International Journal of Biostatistics Volume 14 Issue 1

Abstract

In this paper, we study the zero-inflated Conway-Maxwell Poisson (ZICMP) distribution and develop a regression model. Score and likelihood ratio tests are also implemented for testing the inflation/deflation parameter. Simulation studies are carried out to examine the performance of these tests. A data example is presented to illustrate the concepts. In this example, the proposed model is compared to the well-known zero-inflated Poisson (ZIP) and the zero- inflated generalized Poisson (ZIGP) regression models. It is shown that the fit by ZICMP is comparable or better than these models.

Keywords: zero-inflation/deflation; structural zeros; Poisson; generalized Poisson; regression; score and likelihood ratio tests; over and under-dispersion

Funding statement: This work was supported by Fundamental Research Grant Scheme, Ministry of Higher Education, Malaysia [FP045-2015A] and University of Malaya’s Research Grant Scheme [RP009A-13AFR].

Acknowledgements

We wish to thank the referees for their insightful comments which have vastly improved the paper.

Appendix

(A) Score and Likelihood Ratio Tests

Let Z1,Z2,…,Zn be n rv’s with pmf pz;θ, parameters θ=θ1,θ2,…,θq and Lθ;z be the likelihood function.

The hypothesis of interest

H 0 : θ 1 = θ 1 0 , θ 2 = θ 2 0 , … , θ k = θ k 0 ; θ k + 1 , θ k + 2 , … , θ q u n s p e c i f i e d

against the alternative

H 1 : θ = θ 1 , θ 2 , … , θ q u n s p e c i f i e d

can be tested by using the score and likelihood ratio (LR) tests, where θˆ∗ and θˆ are the maximum likelihood estimates under H0 and H1 respectively. The test statistics of these tests are summarized below.

Likelihood Ratio Test

− 2 ln λ = − 2 ln L θ ˆ ∗ ; z / L θ ˆ ; z

Score Test

S c = U ∗ T Γ ∗ − 1 U ∗

where U∗T=u1θ,u2θ,…,uqθθ=θˆ∗, uiθ=∂lnLθ;z∂θi,i=1,2,…,q

and Γ∗=Γij∗=E−∂2lnLθ;z∂θi∂θjθ=θˆ∗,i,j=1,2,…,q

These test statistics are each asymptotically χ2 distributed with k degrees of freedom.

(B) Derivatives of the CMP pmf and normalizing constant, Zλ,ν

L e t Z i = Z ( λ i , ν )

∂ ln P ( k i \gt 0 ) ∂ ν = − ln ( k i ! ) − 1 Z i ∂ Z i ∂ ν , ∂ ln P ( k i \gt 0 ) ∂ β r = k i X i r − 1 Z i ∂ Z i ∂ β r ,

∂ 2 ln P ( k i \gt 0 ) ∂ ν ∂ β r = ∂ Z i ∂ ν ∂ Z i ∂ β r Z i 2 − ∂ 2 Z i ∂ ν ∂ β r Z i , ∂ 2 ln P ( k i \gt 0 ) ∂ β r ∂ β s = ∂ Z i ∂ β s ∂ Z i ∂ β r Z i 2 − ∂ 2 Z i ∂ β s ∂ β r Z i ,

∂ 2 ln P ( k i > 0 ) ∂ ν 2 = ∂ Z i ∂ ν 2 Z i 2 − ∂ 2 Z i ∂ ν 2 Z i

∂ Z i ∂ β r = ∑ j = 1 ∞ j X i r λ i j ( j ! ) ν , ∂ Z i ∂ ν = ∑ j = 1 ∞ − λ i j ln ( j ! ) ( j ! ) ν ,

∂ 2 Z i ∂ β r ∂ β s = ∑ j = 1 ∞ j 2 X i r X i s λ i j ( j ! ) ν , ∂ 2 Z i ∂ ν ∂ β r = ∑ j = 2 ∞ − j X i r λ i j ln ( j ! ) ( j ! ) ν ,

∂ 2 Z i ∂ ν 2 = ∑ j = 2 ∞ λ i j ln ( j ! ) 2 ( j ! ) ν

References

[1] Gupta PL, Gupta RC, Tripathi RC. Inflated modified power series distributions with applications. Commun Stat Theory Methods. 1995;24:2355–74.10.1080/03610929508831621Search in Google Scholar

[2] Gupta PL, Gupta RC, Tripathi RC. Analysis of zero adjusted count data. Comput Stat Data Anal. 1996;23:207–18.10.1016/S0167-9473(96)00032-1Search in Google Scholar

[3] Gupta PL, Gupta RC, Tripathi RC. Score test for zero inflated generalized Poisson regression model. Commun Stat Theory Methods. 2005;33(1):47–64.10.1081/STA-120026576Search in Google Scholar

[4] Lambert D. Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics. 1992;34:1–14.10.2307/1269547Search in Google Scholar

[5] Ridout M, Hinde J, Demetrio CG. A score test for testing zero inflated Poisson regression model against zero inflated negative binomial alternatives. Biometrics. 2001;57:219–23.10.1111/j.0006-341X.2001.00219.xSearch in Google Scholar PubMed

[6] Yang Z, Hardin JW, Addy CL. Testing overdispersion in the zero-inflated Poisson model. J Stat Plan Inference. 2009;139(9):3340–53.10.1016/j.jspi.2009.03.016Search in Google Scholar

[7] Yang Z, Hardin JW, Addy CL. Score test for overdispersion in zero-inflated Poisson mixed models. Comput Stat Data Anal. 2010a;54(5):1234–46.10.1016/j.csda.2009.11.010Search in Google Scholar

[8] Yang Z, Hardin JW, Addy CL. Score test for zero inflation in overdispersed count data Commun. Stat Theory Methods. 2010b;39(11):2008–30.10.1080/03610920902948228Search in Google Scholar

[9] Hall DB. Zero-inflated Poisson and binomial regression with random effects: A case study. Biometrics. 2000;56:1030–39.10.1111/j.0006-341X.2000.01030.xSearch in Google Scholar

[10] Kelvin KW, Yan KW, Andy HL. Zero inflated negative binomial mixed regression modeling of overdispersed count data with extra zeros. Biometrical J. 2003;45(4):437–52.10.1002/bimj.200390024Search in Google Scholar

[11] Famoye F, Singh KP. Zero-inflated generalized Poisson model with an application to domestic violence data. J Data Sci. 2006;4(1):117–30.10.6339/JDS.2006.04(1).257Search in Google Scholar

[12] Xie FC, Wei BC, Lin JG. Score tests for zero-inflated generalized Poisson mixed regression models. Comput Stat Data Anal. 2009;53:3478–89.10.1016/j.csda.2009.02.017Search in Google Scholar

[13] Consul PC. Generalized Poisson distributions. New York: Marcel Dekker, 1989.Search in Google Scholar

[14] Guikema S, Goffelt J. A flexible count data regression model for risk analysis. Risk Anal. 2008;28(1):213–23.10.1111/j.1539-6924.2008.01014.xSearch in Google Scholar PubMed

[15] Wang W, Famoye F. Modeling household fertility decisions with generalized Poisson regression models. J Popul Econ. 1997;10(3):273–83.10.1007/s001480050043Search in Google Scholar PubMed

[16] Aldieri L, Vinci CP. Education and fertility: an investigation into Italian families. Int J Soc Econ. 2012;39(4):254–63.10.1108/03068291211205686Search in Google Scholar

[17] Ratna MB, Khan HA, Hossain MA. Modeling the number of children ever born in a household in Bangladesh using generalized Poisson regression. ULAB J Sci Eng. 2012;3(1):51–55.Search in Google Scholar

[18] Tin A. Modelling zero-inflated count data with underdispersion and overdispersion. SAS Global Forum Proceedings, Statistics and Data Analysis 2008; (Paper 372-2008).Search in Google Scholar

[19] Sellers KF, Shmueli G. A flexible regression model for count data. Ann Appl Stat. 2010;4(2):943–61.10.1214/09-AOAS306Search in Google Scholar

[20] Conway RW, Maxwell WL. A queuing model with state dependent service rates. J Ind Eng. 1962;12:132–36.Search in Google Scholar

[21] Shmueli G, Minka TP, Kadane JB, Borle S, Boatwright S. A useful distribution for fitting discrete data-revival of the Conway-Maxwell-Poisson distribution. Appl Stat. 2005;54(1):127–42.10.1111/j.1467-9876.2005.00474.xSearch in Google Scholar

[22] Gupta RC, Sim SZ, Ong SH. Analysis of discrete data by Conway-Maxwell Poisson distribution. Adv Stat Anal. 2014;98(4):3247–343.10.1007/s10182-014-0226-4Search in Google Scholar

[23] Kadane JB. Krishnan R and Shmeuli G. A data disclosure policy for count data based on the COM-Poisson distribution. Manag Sci. 2006a;52(10):1610–17.10.1287/mnsc.1060.0562Search in Google Scholar

[24] Rodrigues J, Castro M, Cancho VG, Balakrishnan N. COM-Poisson cure rate survival models and an application to cutaneous melanoma data. J Stat Plan Inference. 2009;139:3605–11.10.1016/j.jspi.2009.04.014Search in Google Scholar

[25] Cancho VG, Castro M, Rodrigues JA. Bayesian analysis of the Conway-Maxwell Poisson cure rate model. Stat Pap. 2012;53(1):165–76.10.1007/s00362-010-0326-5Search in Google Scholar

[26] Kadane JB, Shmeuli G, Minka TP, Borle S, Boatwright P. Conjugate analysis of the Conway-Maxwell Poisson distribution. Bayesian Anal. 2006b;2(2):363–74.Search in Google Scholar

[27] Barriga GD, Louzada F. The zero-inflated Conway–Maxwell–Poisson distribution: Bayesian inference, regression modeling and influence diagnostic. Stat Methodol. 2014;21:23–34.10.1016/j.stamet.2013.11.003Search in Google Scholar

[28] Van den Broek J. A score test for zero inflation in a Poisson distribution. Biometrics. 1995;51(2):738–43.10.2307/2532959Search in Google Scholar

[29] Verbeke G, Molenberghs G. What can go wrong with the score test?. Am Stat. 2007;61(4):289–90.10.1198/000313007X243089Search in Google Scholar

[30] Andrews DW. Estimation when a parameter is on a boundary: part II. [Unpublished manuscript]. Yale University, 1997.Search in Google Scholar

[31] Andrews DW. Estimation when a parameter is on a boundary. Econometrica. 1999;67(6):1341–84.10.1111/1468-0262.00082Search in Google Scholar

[32] Andrews DW. Testing when a parameter is on a boundary of the maintained hypothesis. Econometrica. 2001;69(3):683–734.10.1111/1468-0262.00210Search in Google Scholar

[33] Self SG, Liang KY. Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. J Am Stat Assoc. 1987 Jun.;82(398):605–10.10.1080/01621459.1987.10478472Search in Google Scholar

[34] Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E. Equation of state calculations by fast computing machines. J Chem Phys. 1953;21:1087–92.10.2172/4390578Search in Google Scholar

[35] Sim SZ, Ong SH. A generalized inverse trinomial distribution with application. Stat Methodol. 2016;33:217–33.10.1016/j.stamet.2016.10.001Search in Google Scholar

Received: 2016-08-21

Revised: 2017-11-28

Accepted: 2017-11-29

Published Online: 2018-01-09

You are currently not able to access this content.

Articles in the same Issue

https://doi.org/10.1515/ijb-2016-0070

Keywords for this article

zero-inflation/deflation; structural zeros; Poisson; generalized Poisson; regression; score and likelihood ratio tests; over and under-dispersion