Home Zero-inflated Conway-Maxwell Poisson Distribution to Analyze Discrete Data
Article
Licensed
Unlicensed Requires Authentication

Zero-inflated Conway-Maxwell Poisson Distribution to Analyze Discrete Data

  • Shin Zhu Sim , Ramesh C. Gupta and Seng Huat Ong EMAIL logo
Published/Copyright: January 9, 2018

Abstract

In this paper, we study the zero-inflated Conway-Maxwell Poisson (ZICMP) distribution and develop a regression model. Score and likelihood ratio tests are also implemented for testing the inflation/deflation parameter. Simulation studies are carried out to examine the performance of these tests. A data example is presented to illustrate the concepts. In this example, the proposed model is compared to the well-known zero-inflated Poisson (ZIP) and the zero- inflated generalized Poisson (ZIGP) regression models. It is shown that the fit by ZICMP is comparable or better than these models.

Funding statement: This work was supported by Fundamental Research Grant Scheme, Ministry of Higher Education, Malaysia [FP045-2015A] and University of Malaya’s Research Grant Scheme [RP009A-13AFR].

Acknowledgements

We wish to thank the referees for their insightful comments which have vastly improved the paper.

Appendix

(A) Score and Likelihood Ratio Tests

Let Z1,Z2,,Zn be n rv’s with pmf pz;θ, parameters θ=θ1,θ2,,θq and Lθ;z be the likelihood function.

The hypothesis of interest

H 0 : θ 1 = θ 1 0 , θ 2 = θ 2 0 , , θ k = θ k 0 ; θ k + 1 , θ k + 2 , , θ q u n s p e c i f i e d

against the alternative

H 1 : θ = θ 1 , θ 2 , , θ q u n s p e c i f i e d

can be tested by using the score and likelihood ratio (LR) tests, where θˆ and θˆ are the maximum likelihood estimates under H0 and H1 respectively. The test statistics of these tests are summarized below.

Likelihood Ratio Test

2 ln λ = 2 ln L θ ˆ ; z / L θ ˆ ; z

Score Test

S c = U T Γ 1 U

where UT=u1θ,u2θ,,uqθθ=θˆ, uiθ=lnLθ;zθi,i=1,2,,q

and Γ=Γij=E2lnLθ;zθiθjθ=θˆ,i,j=1,2,,q

These test statistics are each asymptotically χ2 distributed with k degrees of freedom.

(B) Derivatives of the CMP pmf and normalizing constant, Zλ,ν

L e t Z i = Z ( λ i , ν )

ln P ( k i  \gt  0 ) ν = ln ( k i ! ) 1 Z i Z i ν , ln P ( k i  \gt  0 ) β r = k i X i r 1 Z i Z i β r ,

2 ln P ( k i \gt 0 ) ν β r = Z i ν Z i β r Z i 2 2 Z i ν β r Z i , 2 ln P ( k i  \gt  0 ) β r β s = Z i β s Z i β r Z i 2 2 Z i β s β r Z i ,

2 ln P ( k i > 0 ) ν 2 = Z i ν 2 Z i 2 2 Z i ν 2 Z i

Z i β r = j = 1 j X i r λ i j ( j ! ) ν , Z i ν = j = 1 λ i j ln ( j ! ) ( j ! ) ν ,

2 Z i β r β s = j = 1 j 2 X i r X i s λ i j ( j ! ) ν , 2 Z i ν β r = j = 2 j X i r λ i j ln ( j ! ) ( j ! ) ν ,

2 Z i ν 2 = j = 2 λ i j ln ( j ! ) 2 ( j ! ) ν

References

[1] Gupta PL, Gupta RC, Tripathi RC. Inflated modified power series distributions with applications. Commun Stat Theory Methods. 1995;24:2355–74.10.1080/03610929508831621Search in Google Scholar

[2] Gupta PL, Gupta RC, Tripathi RC. Analysis of zero adjusted count data. Comput Stat Data Anal. 1996;23:207–18.10.1016/S0167-9473(96)00032-1Search in Google Scholar

[3] Gupta PL, Gupta RC, Tripathi RC. Score test for zero inflated generalized Poisson regression model. Commun Stat Theory Methods. 2005;33(1):47–64.10.1081/STA-120026576Search in Google Scholar

[4] Lambert D. Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics. 1992;34:1–14.10.2307/1269547Search in Google Scholar

[5] Ridout M, Hinde J, Demetrio CG. A score test for testing zero inflated Poisson regression model against zero inflated negative binomial alternatives. Biometrics. 2001;57:219–23.10.1111/j.0006-341X.2001.00219.xSearch in Google Scholar PubMed

[6] Yang Z, Hardin JW, Addy CL. Testing overdispersion in the zero-inflated Poisson model. J Stat Plan Inference. 2009;139(9):3340–53.10.1016/j.jspi.2009.03.016Search in Google Scholar

[7] Yang Z, Hardin JW, Addy CL. Score test for overdispersion in zero-inflated Poisson mixed models. Comput Stat Data Anal. 2010a;54(5):1234–46.10.1016/j.csda.2009.11.010Search in Google Scholar

[8] Yang Z, Hardin JW, Addy CL. Score test for zero inflation in overdispersed count data Commun. Stat Theory Methods. 2010b;39(11):2008–30.10.1080/03610920902948228Search in Google Scholar

[9] Hall DB. Zero-inflated Poisson and binomial regression with random effects: A case study. Biometrics. 2000;56:1030–39.10.1111/j.0006-341X.2000.01030.xSearch in Google Scholar

[10] Kelvin KW, Yan KW, Andy HL. Zero inflated negative binomial mixed regression modeling of overdispersed count data with extra zeros. Biometrical J. 2003;45(4):437–52.10.1002/bimj.200390024Search in Google Scholar

[11] Famoye F, Singh KP. Zero-inflated generalized Poisson model with an application to domestic violence data. J Data Sci. 2006;4(1):117–30.10.6339/JDS.2006.04(1).257Search in Google Scholar

[12] Xie FC, Wei BC, Lin JG. Score tests for zero-inflated generalized Poisson mixed regression models. Comput Stat Data Anal. 2009;53:3478–89.10.1016/j.csda.2009.02.017Search in Google Scholar

[13] Consul PC. Generalized Poisson distributions. New York: Marcel Dekker, 1989.Search in Google Scholar

[14] Guikema S, Goffelt J. A flexible count data regression model for risk analysis. Risk Anal. 2008;28(1):213–23.10.1111/j.1539-6924.2008.01014.xSearch in Google Scholar PubMed

[15] Wang W, Famoye F. Modeling household fertility decisions with generalized Poisson regression models. J Popul Econ. 1997;10(3):273–83.10.1007/s001480050043Search in Google Scholar PubMed

[16] Aldieri L, Vinci CP. Education and fertility: an investigation into Italian families. Int J Soc Econ. 2012;39(4):254–63.10.1108/03068291211205686Search in Google Scholar

[17] Ratna MB, Khan HA, Hossain MA. Modeling the number of children ever born in a household in Bangladesh using generalized Poisson regression. ULAB J Sci Eng. 2012;3(1):51–55.Search in Google Scholar

[18] Tin A. Modelling zero-inflated count data with underdispersion and overdispersion. SAS Global Forum Proceedings, Statistics and Data Analysis 2008; (Paper 372-2008).Search in Google Scholar

[19] Sellers KF, Shmueli G. A flexible regression model for count data. Ann Appl Stat. 2010;4(2):943–61.10.1214/09-AOAS306Search in Google Scholar

[20] Conway RW, Maxwell WL. A queuing model with state dependent service rates. J Ind Eng. 1962;12:132–36.Search in Google Scholar

[21] Shmueli G, Minka TP, Kadane JB, Borle S, Boatwright S. A useful distribution for fitting discrete data-revival of the Conway-Maxwell-Poisson distribution. Appl Stat. 2005;54(1):127–42.10.1111/j.1467-9876.2005.00474.xSearch in Google Scholar

[22] Gupta RC, Sim SZ, Ong SH. Analysis of discrete data by Conway-Maxwell Poisson distribution. Adv Stat Anal. 2014;98(4):3247–343.10.1007/s10182-014-0226-4Search in Google Scholar

[23] Kadane JB. Krishnan R and Shmeuli G. A data disclosure policy for count data based on the COM-Poisson distribution. Manag Sci. 2006a;52(10):1610–17.10.1287/mnsc.1060.0562Search in Google Scholar

[24] Rodrigues J, Castro M, Cancho VG, Balakrishnan N. COM-Poisson cure rate survival models and an application to cutaneous melanoma data. J Stat Plan Inference. 2009;139:3605–11.10.1016/j.jspi.2009.04.014Search in Google Scholar

[25] Cancho VG, Castro M, Rodrigues JA. Bayesian analysis of the Conway-Maxwell Poisson cure rate model. Stat Pap. 2012;53(1):165–76.10.1007/s00362-010-0326-5Search in Google Scholar

[26] Kadane JB, Shmeuli G, Minka TP, Borle S, Boatwright P. Conjugate analysis of the Conway-Maxwell Poisson distribution. Bayesian Anal. 2006b;2(2):363–74.Search in Google Scholar

[27] Barriga GD, Louzada F. The zero-inflated Conway–Maxwell–Poisson distribution: Bayesian inference, regression modeling and influence diagnostic. Stat Methodol. 2014;21:23–34.10.1016/j.stamet.2013.11.003Search in Google Scholar

[28] Van den Broek J. A score test for zero inflation in a Poisson distribution. Biometrics. 1995;51(2):738–43.10.2307/2532959Search in Google Scholar

[29] Verbeke G, Molenberghs G. What can go wrong with the score test?. Am Stat. 2007;61(4):289–90.10.1198/000313007X243089Search in Google Scholar

[30] Andrews DW. Estimation when a parameter is on a boundary: part II. [Unpublished manuscript]. Yale University, 1997.Search in Google Scholar

[31] Andrews DW. Estimation when a parameter is on a boundary. Econometrica. 1999;67(6):1341–84.10.1111/1468-0262.00082Search in Google Scholar

[32] Andrews DW. Testing when a parameter is on a boundary of the maintained hypothesis. Econometrica. 2001;69(3):683–734.10.1111/1468-0262.00210Search in Google Scholar

[33] Self SG, Liang KY. Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. J Am Stat Assoc. 1987 Jun.;82(398):605–10.10.1080/01621459.1987.10478472Search in Google Scholar

[34] Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E. Equation of state calculations by fast computing machines. J Chem Phys. 1953;21:1087–92.10.2172/4390578Search in Google Scholar

[35] Sim SZ, Ong SH. A generalized inverse trinomial distribution with application. Stat Methodol. 2016;33:217–33.10.1016/j.stamet.2016.10.001Search in Google Scholar

Received: 2016-08-21
Revised: 2017-11-28
Accepted: 2017-11-29
Published Online: 2018-01-09

© 2018 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 18.9.2025 from https://www.degruyterbrill.com/document/doi/10.1515/ijb-2016-0070/html
Scroll to top button