Statsmodels: statistical modeling and econometrics in Python

License: Other

Language: Python

Keywords: data-analysis, econometrics, generalized-linear-models, python, regression-models, statistics, timeseries-analysis

Travis Build Status Azure CI Build Status Appveyor Build Status Coveralls Coverage

About statsmodels

statsmodels is a Python package that provides a complement to scipy for statistical computations including descriptive statistics and estimation and inference for statistical models.


The documentation for the latest release is at

The documentation for the development version is at

Recent improvements are highlighted in the release notes

Backups of documentation are available at and

Main Features

  • Linear regression models:
    • Ordinary least squares
    • Generalized least squares
    • Weighted least squares
    • Least squares with autoregressive errors
    • Quantile regression
    • Recursive least squares
  • Mixed Linear Model with mixed effects and variance components
  • GLM: Generalized linear models with support for all of the one-parameter exponential family distributions
  • Bayesian Mixed GLM for Binomial and Poisson
  • GEE: Generalized Estimating Equations for one-way clustered or longitudinal data
  • Discrete models:
    • Logit and Probit
    • Multinomial logit (MNLogit)
    • Poisson and Generalized Poisson regression
    • Negative Binomial regression
    • Zero-Inflated Count models
  • RLM: Robust linear models with support for several M-estimators.
  • Time Series Analysis: models for time series analysis
    • Complete StateSpace modeling framework
      • Seasonal ARIMA and ARIMAX models
      • VARMA and VARMAX models
      • Dynamic Factor models
      • Unobserved Component models
    • Markov switching models (MSAR), also known as Hidden Markov Models (HMM)
    • Univariate time series analysis: AR, ARIMA
    • Vector autoregressive models, VAR and structural VAR
    • Vector error correction modle, VECM
    • exponential smoothing, Holt-Winters
    • Hypothesis tests for time series: unit root, cointegration and others
    • Descriptive statistics and process models for time series analysis
  • Survival analysis:
    • Proportional hazards regression (Cox models)
    • Survivor function estimation (Kaplan-Meier)
    • Cumulative incidence function estimation
  • Multivariate:
    • Principal Component Analysis with missing data
    • Factor Analysis with rotation
    • MANOVA
    • Canonical Correlation
  • Nonparametric statistics: Univariate and multivariate kernel density estimators
  • Datasets: Datasets used for examples and in testing
  • Statistics: a wide range of statistical tests
    • diagnostics and specification tests
    • goodness-of-fit and normality tests
    • functions for multiple testing
    • various additional statistical tests
  • Imputation with MICE, regression on order statistic and Gaussian imputation
  • Mediation analysis
  • Graphics includes plot functions for visual analysis of data and model results
  • I/O
    • Tools for reading Stata .dta files, but pandas has a more recent version
    • Table output to ascii, latex, and html
  • Miscellaneous models
  • Sandbox: statsmodels contains a sandbox folder with code in various stages of development and testing which is not considered "production ready". This covers among others
    • Generalized method of moments (GMM) estimators
    • Kernel regression
    • Various extensions to scipy.stats.distributions
    • Panel data models
    • Information theoretic measures

How to get it

The master branch on GitHub is the most up to date code

Source download of release tags are available on GitHub

Binaries and source distributions are available from PyPi

Binaries can be installed in Anaconda

conda install statsmodels

Installing from sources

See INSTALL.txt for requirements or see the documentation


Contributions in any form are welcome, including:

  • Documentation improvements
  • Additional tests
  • New features to existing models
  • New models

for instructions on installing statsmodels in editable mode.


Modified BSD (3-clause)

Discussion and Development

Discussions take place on the mailing list

and in the issue tracker. We are very interested in feedback about usability and suggestions for improvements.

Bug Reports

Bug reports can be submitted to the issue tracker at

Project Statistics

Sourcerank 15
Repository Size 36.2 MB
Stars 4,740
Forks 1,848
Watchers 258
Open issues 1,907
Dependencies 22
Contributors 218
Tags 35
Last updated
Last pushed

Top Contributors See all

Josef Perktold Skipper Seabold ChadFulton Kevin Sheppard Kerby Shedden jbrockmendel j-grana6 Peter Quackenbush Vincent Arel-Bundock Wes McKinney Ian Langmore Bart Baker Ralf Gommers yogabonito Evgeny Zhurko Matthew Brett Enrico Giampieri Tom Augspurger Yichuan Liu Paul Hobson

Packages Referencing this Repo

Statistical computations and models for Python
Latest release 0.11.1 - Updated - 4.74K stars
Statsmodels is a Python module that allows users to explore data, estimate statistical models, an...
Latest release 0.11.0 - Updated - 4.74K stars

Recent Tags See all

v0.11.1 February 21, 2020
v0.12.0.dev0 January 22, 2020
v0.11.0 January 22, 2020
v0.11.0rc2 January 15, 2020
v0.11.0rc1 December 18, 2019
v0.10.2 November 23, 2019
v0.10.1 July 19, 2019
v0.11.0dev0 July 15, 2019
v0.10.0 June 24, 2019
v0.10.0rc2 June 07, 2019
v0.11.0.dev0 September 21, 2018
v0.10.0rc1 September 21, 2018
v0.10.0.dev0 September 12, 2018
v0.9.0 May 14, 2018
0.9.0rc1 April 30, 2018

Interesting Forks See all

main repo of statsmodels
Python - BSD-3-Clause - Last pushed - 16 stars - 15 forks
Statsmodels: statistical modeling and econometrics in Python
Python - Other - Last pushed - 7 stars - 12 forks
main repo of statsmodels
Python - BSD-3-Clause - Last pushed - 6 stars - 2 forks
Statsmodels: statistical modeling and econometrics in Python
Python - Other - Last pushed - 4 stars - 4 forks
Statsmodels: statistical modeling and econometrics in Python
Python - BSD-3-Clause - Last pushed - 3 stars - 1 forks

Something wrong with this page? Make a suggestion

Last synced: 2020-02-21 13:18:05 UTC

Login to resync this repository