language-icon Old Web
English
Sign In

Poisson regression

In statistics, Poisson regression is a generalized linear model form of regression analysis used to model count data and contingency tables. Poisson regression assumes the response variable Y has a Poisson distribution, and assumes the logarithm of its expected value can be modeled by a linear combination of unknown parameters. A Poisson regression model is sometimes known as a log-linear model, especially when used to model contingency tables. In statistics, Poisson regression is a generalized linear model form of regression analysis used to model count data and contingency tables. Poisson regression assumes the response variable Y has a Poisson distribution, and assumes the logarithm of its expected value can be modeled by a linear combination of unknown parameters. A Poisson regression model is sometimes known as a log-linear model, especially when used to model contingency tables. Negative binomial regression is a popular generalization of Poisson regression because it loosens the highly restrictive assumption that the variance is equal to the mean made by the Poisson model. The traditional negative binomial regression model, commonly known as NB2, is based on the Poisson-gamma mixture distribution. This model is popular because it models the Poisson heterogeneity with a gamma distribution. Poisson regression models are generalized linear models with the logarithm as the (canonical) link function, and the Poisson distribution function as the assumed probability distribution of the response. If x ∈ R n {displaystyle mathbf {x} in mathbb {R} ^{n}} is a vector of independent variables, then the model takes the form where α ∈ R {displaystyle alpha in mathbb {R} } and β ∈ R n {displaystyle mathbf {eta } in mathbb {R} ^{n}} . Sometimes this is written more compactly as where x is now an (n + 1)-dimensional vector consisting of n independent variables concatenated to a vector of ones. Here θ is simply α concatenated to β. Thus, when given a Poisson regression model θ and an input vector x, the predicted mean of the associated Poisson distribution is given by If Yi are independent observations with corresponding values xi of the predictor variables, then θ can be estimated by maximum likelihood. The maximum-likelihood estimates lack a closed-form expression and must be found by numerical methods. The probability surface for maximum-likelihood Poisson regression is always concave, making Newton–Raphson or other gradient-based methods appropriate estimation techniques. Given a set of parameters θ and an input vector x, the mean of the predicted Poisson distribution, as stated above, is given by

[ "Diabetes mellitus", "Population", "Index of dispersion", "Zero-inflated model", "Compound Poisson process", "Overdispersion", "Compound Poisson distribution" ]
Parent Topic
Child Topic
    No Parent Topic