The analysis of count data: a gentle introduction to poisson regression and its alternatives

J Pers Assess. 2009 Mar;91(2):121-36. doi: 10.1080/00223890802634175.

Abstract

Count data reflect the number of occurrences of a behavior in a fixed period of time (e.g., number of aggressive acts by children during a playground period). In cases in which the outcome variable is a count with a low arithmetic mean (typically < 10), standard ordinary least squares regression may produce biased results. We provide an introduction to regression models that provide appropriate analyses for count data. We introduce standard Poisson regression with an example and discuss its interpretation. Two variants of Poisson regression, overdispersed Poisson regression and negative binomial regression, are introduced that may provide more optimal results when a key assumption of standard Poisson regression is violated. We also discuss the problems of excess zeros in which a subgroup of respondents who would never display the behavior are included in the sample and truncated zeros in which respondents who have a zero count are excluded by the sampling plan. We provide computer syntax for our illustrations in SAS and SPSS. The Poisson family of regression models provides improved and now easy to implement analyses of count data. [Supplementary materials are available for this article. Go to the publisher's online edition of Journal of Personality Assessment for the following free supplemental resources: the data set used to illustrate Poisson regression in this article, which is available in three formats-a text file, an SPSS database, or a SAS database.].

MeSH terms

  • Aggression* / psychology
  • Child
  • Child Behavior* / psychology
  • Data Interpretation, Statistical
  • Humans
  • Models, Statistical*
  • Poisson Distribution*
  • Psychometrics
  • Regression Analysis