Poisson regression
Poisson regression is a form of statistical modeling used to model count data and contingency tables. It assumes that the dependent variable, Y, follows a Poisson distribution and the logarithm of its expected value can be modeled by a linear combination of unknown parameters. It is a type of generalized linear model (GLM).
Overview[edit | edit source]
Poisson regression is used when the data being analyzed are counts, which are non-negative integers (0, 1, 2, ...). The counts typically represent the number of times an event occurred in a fixed interval of time or space. The model provides a framework for understanding how the expected counts change in response to changes in the explanatory variables.
Mathematical Formulation[edit | edit source]
The basic form of Poisson regression is:
\[ \log(\mathbb{E}(Y|X)) = \beta_0 + \beta_1X_1 + \cdots + \beta_pX_p \]
where:
- \( Y \) is the count of events,
- \( X_1, \ldots, X_p \) are the explanatory variables,
- \( \beta_0, \beta_1, \ldots, \beta_p \) are the parameters to be estimated,
- \( \mathbb{E}(Y|X) \) is the expected value of \( Y \) given \( X \).
Assumptions[edit | edit source]
The key assumptions of Poisson regression include:
- The mean and variance of the dependent variable (Y) are equal.
- Observations are independent of each other.
- The logarithm of the expected value of the response variable can be modeled by a linear combination of input variables.
Applications[edit | edit source]
Poisson regression is widely used in various fields such as:
- Epidemiology for analyzing the rates of diseases,
- Insurance for claim count modeling,
- Transportation for traffic flow analysis,
- Sociology for crime or accident counts.
Model Fitting[edit | edit source]
The parameters of a Poisson regression model are typically estimated using maximum likelihood estimation (MLE). Software packages like R, SAS, and SPSS provide functions to fit Poisson regression models to data.
Extensions[edit | edit source]
Several extensions of Poisson regression exist to handle overdispersion and excess zeros, which are common issues in count data. These include:
- Negative binomial regression,
- Zero-inflated Poisson models,
- Hurdle models.
See Also[edit | edit source]
Search WikiMD
Ad.Tired of being Overweight? Try W8MD's physician weight loss program.
Semaglutide (Ozempic / Wegovy and Tirzepatide (Mounjaro / Zepbound) available.
Advertise on WikiMD
WikiMD's Wellness Encyclopedia |
Let Food Be Thy Medicine Medicine Thy Food - Hippocrates |
Translate this page: - East Asian
中文,
日本,
한국어,
South Asian
हिन्दी,
தமிழ்,
తెలుగు,
Urdu,
ಕನ್ನಡ,
Southeast Asian
Indonesian,
Vietnamese,
Thai,
မြန်မာဘာသာ,
বাংলা
European
español,
Deutsch,
français,
Greek,
português do Brasil,
polski,
română,
русский,
Nederlands,
norsk,
svenska,
suomi,
Italian
Middle Eastern & African
عربى,
Turkish,
Persian,
Hebrew,
Afrikaans,
isiZulu,
Kiswahili,
Other
Bulgarian,
Hungarian,
Czech,
Swedish,
മലയാളം,
मराठी,
ਪੰਜਾਬੀ,
ગુજરાતી,
Portuguese,
Ukrainian
WikiMD is not a substitute for professional medical advice. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates Wikipedia, licensed under CC BY SA or similar.
Contributors: Prab R. Tumpati, MD