Statistical software are specialized
computer program
A computer program is a sequence or set of instructions in a programming language for a computer to execute. Computer programs are one component of software, which also includes documentation and other intangible components.
A computer progra ...
s for analysis in
statistics and
econometrics
Econometrics is the application of statistical methods to economic data in order to give empirical content to economic relationships.M. Hashem Pesaran (1987). "Econometrics," '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8� ...
.
Open-source
*
ADaMSoft – a generalized statistical software with
data mining algorithms and methods for data management
*
ADMB – a software suite for non-linear statistical modeling based on
C++ which uses
automatic differentiation
*
Chronux
Chronux is an open-source software package developed for the loading, visualization and analysis of a variety of modalities / formats of neurobiological time series data. Usage of this tool enables neuroscientists to perform a variety of analysis ...
– for neurobiological time series data
*
DAP – free replacement for SAS
*
Environment for DeveLoping KDD-Applications Supported by Index-Structures (ELKI) a
software framework for developing
data mining algorithms in
Java
Java (; id, Jawa, ; jv, ꦗꦮ; su, ) is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea to the north. With a population of 151.6 million people, Java is the world's mo ...
*
Epi Info –
statistical software for
epidemiology developed by
Centers for Disease Control and Prevention
The Centers for Disease Control and Prevention (CDC) is the national public health agency of the United States. It is a United States federal agency, under the Department of Health and Human Services, and is headquartered in Atlanta, Georg ...
(CDC). Apache 2 licensed
*
Fityk – nonlinear regression software (GUI and command line)
*
GNU Octave – programming language very similar to MATLAB with statistical features
*
gretl – gnu regression, econometrics and time-series library
*
intrinsic Noise Analyzer (iNA) – For analyzing intrinsic fluctuations in biochemical systems
*
jamovi – A free software alternative to IBM
SPSS Statistics
*
JASP – A free software alternative to IBM
SPSS Statistics with additional option for Bayesian methods
*
JMulTi – For
econometric analysis, specialised in univariate and multivariate
time series
In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. E ...
analysis
*
Just another Gibbs sampler (JAGS) – a program for analyzing Bayesian hierarchical models using
Markov chain Monte Carlo developed by Martyn Plummer. It is similar to WinBUGS
*
KNIME – An open source analytics platform built with
Java
Java (; id, Jawa, ; jv, ꦗꦮ; su, ) is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea to the north. With a population of 151.6 million people, Java is the world's mo ...
and
Eclipse using modular data pipeline workflows
*
LIBSVM – C++ support vector machine libraries
*
mlpack – open-source library for machine learning, exploits C++ language features to provide maximum performance and flexibility while providing a simple and consistent
application programming interface (API)
*
Mondrian
Pieter Cornelis Mondriaan (), after 1906 known as Piet Mondrian (, also , ; 7 March 1872 – 1 February 1944), was a Dutch painter and art theoretician who is regarded as one of the greatest artists of the 20th century. He is known for being o ...
– data analysis tool using interactive statistical graphics with a link to
R
*
Neurophysiological Biomarker Toolbox – Matlab toolbox for data-mining of neurophysiological biomarkers
*
OpenBUGS
*
OpenEpi – A web-based, open-source, operating-independent series of programs for use in epidemiology and statistics based on JavaScript and HTML
*
OpenMx – A package for
structural equation modeling running in
R (programming language)
R is a programming language for statistical computing and graphics supported by the R Core Team and the R Foundation for Statistical Computing. Created by statisticians Ross Ihaka and Robert Gentleman, R is used among data miners, bioinform ...
*
OpenNN
OpenNN (Open Neural Networks Library) is a software library written in the C++ programming language which implements neural networks, a main area of deep learning research. The library is open-source, licensed under the GNU Lesser General Public L ...
– A
software library written in the programming language
C++ which implements
neural networks, a main area of
deep learning research
*
Orange, a
data mining,
machine learning
Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence.
Machine ...
, and
bioinformatics
Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combin ...
software
*
Pandas –
High-performance computing (HPC) data structures and data analysis tools for
Python in Python and
Cython (statsmodels,
scikit-learn)
*
Perl Data Language – Scientific computing with Perl
*
Ploticus – software for generating a variety of graphs from raw data
*
PSPP – A free software alternative to IBM
SPSS Statistics
*
R –
free
Free may refer to:
Concept
* Freedom, having the ability to do something, without having to obey anyone/anything
* Freethought, a position that beliefs should be formed only on the basis of logic, reason, and empiricism
* Emancipate, to procur ...
implementation of the
S (programming language)
**
Programming with Big Data in R (pbdR) – a series of R packages enhanced by
SPMD parallelism for
big data analysis
**
R Commander – GUI interface for R
**
Rattle GUI – GUI interface for R
**
Revolution Analytics – production-grade software for the enterprise big data analytics
**
RStudio – GUI interface and development environment for R
*
ROOT – an open-source C++ system for data storage, processing and analysis, developed by CERN and used to find the
Higgs boson
*
Salstat – menu-driven statistics software
*
Scilab – uses GPL-compatible
CeCILL license
*
SciPy –
Python library for scientific computing that contains the ''stats'' sub-package which is partly based on the venerable '', STAT'' (a.k.a. ''PipeStat'', formerly ''UNIX, STAT'') software
**
scikit-learn – extends SciPy with a host of machine learning models (classification, clustering, regression, etc.)
**
statsmodels – extends SciPy with statistical models and tests (regression, plotting, example datasets,
generalized linear model (GLM),
time series analysis,
autoregressive–moving-average model (ARMA),
vector autoregression (VAR),
non-parametric statistics
Nonparametric statistics is the branch of statistics that is not based solely on parametrized families of probability distributions (common examples of parameters are the mean and variance). Nonparametric statistics is based on either being distri ...
,
ANOVA,
empirical likelihood)
*
Shogun (toolbox) –
open-source, large-scale machine learning toolbox that provides several SVM (Support Vector Machine) implementations (like libSVM, SVMlight) under a common framework and interfaces to Octave, MATLAB, Python, R
*
Simfit – simulation, curve fitting, statistics, and plotting
*
SOCR
*
SOFA Statistics – desktop GUI program focused on ease of use, learn as you go, and beautiful output
*
Stan (software) – open-source package for obtaining Bayesian inference using the No-U-Turn sampler, a variant of
Hamiltonian Monte Carlo. It is somewhat like BUGS, but with a different language for expressing models and a different sampler for sampling from their posteriors
*
Statistical Lab
The computer program Statistical Lab (Statistiklabor) is an explorative and interactive toolbox for statistical analysis and visualization of data. It supports educational applications of statistics in business administration, economics, social sc ...
–
R-based and focusing on educational purposes
*
TOPCAT (software) – interactive graphical analysis and manipulation package for astronomers that understands
FITS, VOTable and
CDF formats.
*
Torch (machine learning) – a
deep learning software library written in
Lua (programming language)
Lua ( ; from meaning ''moon'') is a lightweight, high-level, multi-paradigm programming language designed primarily for embedded use in applications. Lua is cross-platform, since the interpreter of compiled bytecode is written in ANSI C ...
*
Weka (machine learning) – a suite of machine learning software written at the
University of Waikato
Public domain
*
CSPro (core is public domain but without publicly available source code; the web UI has been open sourced under Apache version 2 and the help system under GPL version 3)
*
Dataplot
Dataplot is a public domain software system for scientific visualization and statistical analysis. It was developed and is being maintained at the National Institute of Standards and Technology. Dataplot's source code
In computing, source cod ...
(NIST)
*
X-13ARIMA-SEATS (public domain in the United States only; outside of the United States is under US government copyright)
Freeware
*
BV4.1
The application software BV4.1 is an easy-to-use tool for decomposing and seasonally adjusting monthly or quarterly economic time series by version 4.1 of the Berlin procedure. It is being developed by the Federal Statistical Office of Germany ...
*
GeoDA
*
MaxStat Lite – general statistical software
*
MINUIT MINUIT, now MINUIT2, is a numerical minimization software library developed at the European Organization for Nuclear Research (CERN). It provides several algorithms that search for parameter values that minimize a user-defined function, and comput ...
*
WinBUGS –
Bayesian analysis using
Markov chain Monte Carlo methods
*
Winpepi – package of statistical programs for epidemiologists
Proprietary
*
Alteryx – analytics platform with drag and drop statistical models; R and Python integration
*
Analytica – visual analytics and statistics package
*
Angoss – products KnowledgeSEEKER and KnowledgeSTUDIO incorporate several data mining algorithms
*
ASReml – for restricted maximum likelihood analyses
*
BMDP
BMDP was a statistical package developed in 1965 by Wilfrid Dixon at the University of California, Los Angeles. The acronym stands for Bio-Medical Data Package, the word package was added by Dixon as the software consisted of a series of programs ...
– general statistics package
*
DataGraph – visual analysis with linear and nonlinear regression
*
DB Lytix – 800+ in-database models
*
EViews – for
econometric analysis
*
FAME (database) – a system for managing
time-series databases
*
GAUSS –
programming language
A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language.
The description of a programming l ...
for statistics
*
Genedata
Genedata is a Swiss-headquartered bioinformatics company that provides enterprise software that support large-scale, experimental processes in life science research. The company focuses on automating data-rich, highly complex data workflows in b ...
– software for integration and interpretation of experimental data in the life science R&D
*
GenStat – general statistics package
*
GLIM
Glim or Gleam ( arz, جليم) is a neighborhood in Alexandria, Egypt.
See also
* Neighborhoods in Alexandria
Alexandria, Virginia, an independent city in the Commonwealth of Virginia, is located along the western bank of the Potomac Ri ...
– early package for fitting
generalized linear models
*
GraphPad InStat
GraphPad Software Inc. was a privately held software development corporation until its acquisition by Insight Partners in 2017. The company was named Insightful Science, which itself merged with Dotmatics in 2021. The original software was writt ...
– very simple with much guidance and explanations
*
GraphPad Prism – biostatistics and nonlinear regression with clear explanations
*
IMSL Numerical Libraries – software library with statistical algorithms
*
JMP – visual analysis and statistics package
*
LIMDEP
LIMDEP is an econometric and statistical software package with a variety of estimation tools. In addition to the core econometric tools for analysis of cross sections and time series, LIMDEP supports methods for panel data analysis, frontier and e ...
– comprehensive statistics and econometrics package
*
LISREL – statistics package used in structural equation modeling
*
Maple
''Acer'' () is a genus of trees and shrubs commonly known as maples. The genus is placed in the family Sapindaceae.Stevens, P. F. (2001 onwards). Angiosperm Phylogeny Website. Version 9, June 2008 nd more or less continuously updated since ht ...
–
programming language
A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language.
The description of a programming l ...
with statistical features
*
Mathematica
Wolfram Mathematica is a software system with built-in libraries for several areas of technical computing that allow machine learning, statistics, symbolic computation, data manipulation, network analysis, time series analysis, NLP, optimi ...
– a software package with statistical particularly ŋ features
*
MATLAB
MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementa ...
–
programming language
A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language.
The description of a programming l ...
with statistical features
*
MaxStat Pro – general statistical software
*
MedCalc – for biomedical sciences
*
Microfit
Microfit is a statistics package developed by Bahram Pesaran and M. Hashem Pesaran, and published by Oxford University Press. It is designed for econometric modelling with time series
In mathematics, a time series is a series of data points in ...
– econometrics package, time series
*
Minitab – general statistics package
*
MLwiN – multilevel models (free to UK academics)
*
Nacsport Video Analysis Software – software for analysing sports and obtaining statistical intelligence
*
NAG Numerical Library – comprehensive math and statistics library
*
NCSS – general statistics package
*
Neural Designer
Neural Designer is a software tool for machine learning based on neural networks, a main area of artificial intelligence research, and contains a graphical user interface which simplifies data entry and interpretation of results.
In 2015, Neural ...
– commercial
deep learning package
*
NLOGIT – comprehensive statistics and econometrics package
*
nQuery Sample Size Software – Sample Size and Power Analysis Software
*
O-Matrix
O-Matrix is a matrix programming language for mathematics, engineering, science, and financial analysis, marketed by Harmonic Software. The language is designed for use in high-performance computing.
O-Matrix provides an integrated develo ...
–
programming language
A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language.
The description of a programming l ...
*
OriginPro – statistics and graphing, programming access to
NAG library
*
PASS Sample Size Software (PASS) – power and sample size software from NCSS
*
Plotly
Plotly is a technical computing company headquartered in Montreal, Quebec, that develops online data analytics and data visualization, visualization tools. Plotly provides online graphing, analytics, and statistics tools for individuals and coll ...
– plotting library and styling interface for analyzing data and creating browser-based graphs. Available for
R,
Python,
MATLAB
MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementa ...
,
Julia, and
Perl
Perl is a family of two High-level programming language, high-level, General-purpose programming language, general-purpose, Interpreter (computing), interpreted, dynamic programming languages. "Perl" refers to Perl 5, but from 2000 to 2019 it ...
*
Primer-E Primer – environmental and ecological specific
*
PV-WAVE –
programming language
A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language.
The description of a programming l ...
comprehensive data analysis and visualization with IMSL statistical package
*
Qlucore Omics Explorer – interactive and visual data analysis software
*
RapidMiner
RapidMiner is a data science platform designed for enterprises that analyses the collective impact of organizations’ employees, expertise and data. Rapid Miner's data science platform is intended to support many analytics users across a broad AI ...
–
machine learning
Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence.
Machine ...
toolbox
*
Regression Analysis of Time Series (RATS) – comprehensive
econometric analysis package
*
S-PLUS – general statistics package
*
SAS (software) – comprehensive statistical package
*
SHAZAM (Econometrics and Statistics Software) – comprehensive econometrics and statistics package
*
SigmaStat – package for group analysis
*
Simul
SIMUL - i.e. Système Intégré de Modélisation mULti-dimensionelle - is an econometric tool for the multidimensional (multi-sectoral and multi-regional) modelling. It allows to implement easily multidimensional econometric models according to ...
– econometric tool for multidimensional (multi-sectoral, multi-regional) modeling
*
SmartPLS – statistics package used in
partial least squares path modeling (PLS) and PLS-based
structural equation modeling
*
SOCR – online tools for teaching
statistics and
probability theory
Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
*
Speakeasy (computational environment)
Speakeasy was a numerical computing interactive environment also featuring an interpreted programming language. It was initially developed for internal use at the Physics Division of Argonne National Laboratory by the theoretical physicist ...
– numerical computational environment and programming language with many statistical and
econometric analysis features
*
SPSS Modeler – comprehensive data mining and text analytics workbench
*
SPSS Statistics – comprehensive statistics package
*
Stata – comprehensive statistics package
*
StatCrunch
StatCrunch is a web-based statistical software application from Pearson Education. StatCrunch was originally created for use in college statistics courses. As a full-featured statistics package, it is now also used for research and for other sta ...
– comprehensive statistics package, originally designed for college statistics courses
*
Statgraphics – general statistics package to include
cloud computing
Cloud computing is the on-demand availability of computer system resources, especially data storage ( cloud storage) and computing power, without direct active management by the user. Large clouds often have functions distributed over m ...
and
Six Sigma for use in
business development, process improvement,
data visualization and statistical analysis,
design of experiment, point processes, geospatial analysis, regression, and time series analysis are all included within this complete statistical package.
*
Statistica
Statistica is an advanced analytics software package originally developed by StatSoft and currently maintained by TIBCO Software Inc.
Statistica provides data analysis, data management, statistics, data mining, machine learning, text analytics a ...
– comprehensive statistics package
*
StatsDirect
StatsDirect is a statistical software package designed for biomedical, public health, and general health science uses. The second generation of the software was reviewed in general medical and public health journals.
Features and use
StatsDire ...
– statistics package designed for biomedical, public health and general health science uses
*
StatXact – package for exact nonparametric and parametric statistics
*
SuperCROSS – comprehensive statistics package with ad-hoc, cross tabulation analysis
*
Systat – general statistics package
*
The Unscrambler
The Unscrambler X is a commercial software product for multivariate data analysis, used for calibration of multivariate data which is often in the application of analytical data such as near infrared spectroscopy and Raman spectroscopy, and deve ...
– free-to-try commercial
multivariate analysis software for Windows
*
Unistat – general statistics package that can also work as Excel add-in
*
WarpPLS – statistics package used in
structural equation modeling
*
Wolfram Language – the computer language that evolved from the program
Mathematica
Wolfram Mathematica is a software system with built-in libraries for several areas of technical computing that allow machine learning, statistics, symbolic computation, data manipulation, network analysis, time series analysis, NLP, optimi ...
. It has similar statistical capabilities as Mathematica.
*
World Programming System (WPS) – statistical package that supports the use of
Python,
R and
SAS
SAS or Sas may refer to:
Arts, entertainment, and media
* ''SAS'' (novel series), a French book series by Gérard de Villiers
* ''Shimmer and Shine'', an American animated children's television series
* Southern All Stars, a Japanese rock ba ...
languages within in a single user program.
*
XploRe
Add-ons
*
Analyse-it
Analyse-it is a statistical analysis add-in for Microsoft Excel. Analyse-it is the successor to Astute, developed in 1992 for Excel 4 and the first statistical analysis add-in for Microsoft Excel. Analyse-it provides a range of standard parametric ...
– add-on to
Microsoft Excel
Microsoft Excel is a spreadsheet developed by Microsoft for Windows, macOS, Android and iOS. It features calculation or computation capabilities, graphing tools, pivot tables, and a macro programming language called Visual Basic for ...
for statistical analysis
*
Statgraphics Sigma Express – add-on to
Microsoft Excel
Microsoft Excel is a spreadsheet developed by Microsoft for Windows, macOS, Android and iOS. It features calculation or computation capabilities, graphing tools, pivot tables, and a macro programming language called Visual Basic for ...
for
Six Sigma statistical analysis
*
SUDAAN
SUDAAN is a proprietary statistical software package for the analysis of correlated data, including correlated data encountered in complex sample surveys. SUDAAN originated in 1972 at RTI International (the trade name of Research Triangle Institu ...
– add-on to
SAS
SAS or Sas may refer to:
Arts, entertainment, and media
* ''SAS'' (novel series), a French book series by Gérard de Villiers
* ''Shimmer and Shine'', an American animated children's television series
* Southern All Stars, a Japanese rock ba ...
and
SPSS for
statistical surveys
*
XLfit add-on to
Microsoft Excel
Microsoft Excel is a spreadsheet developed by Microsoft for Windows, macOS, Android and iOS. It features calculation or computation capabilities, graphing tools, pivot tables, and a macro programming language called Visual Basic for ...
for curve fitting and statistical analysis
See also
*
Comparison of statistical packages
*
Free statistical software
*
List of computer algebra systems
*
List of information graphics software
*
List of numerical libraries
*
List of numerical-analysis software
*
Mathematical software
*
Psychometric software
References
External links
*
{{Public health
Statistical packages
Software
Software is a set of computer programs and associated software documentation, documentation and data (computing), data. This is in contrast to Computer hardware, hardware, from which the system is built and which actually performs the work.
...