SciPy Statistical Functions Last Updated : 23 Jun, 2025 Summarize Comments Improve Suggest changes Share Like Article Like Report SciPy provides a comprehensive set of statistical functions in its scipy.stats module. These tools are important for performing descriptive statistics, statistical testing, probability distributions and random variable operations in scientific and data analysis workflows.The following table summarizes key statistical functions in SciPyFunctionDescriptionMean() Calculates the average of a dataset by summing all values and dividing by the number of values.Mode() Finds the most frequent value(s) in a dataset.Median() Identifies the middle value in a sorted dataset.standard deviationMeasures the spread of data points around the meanharmonic meanCalculates the reciprocal of the average of the reciprocals, useful for rates.varianceA measure of spread, calculated as the average squared deviation from the mean.standard errorThe standard deviation of a sampling distribution, indicating variability.percentileA value below which a given percentage of observations fallskewness Measures the asymmetry of a distributionlog-logistic distributionModels survival data and is used in reliability analysis.Student’s t DistributionEstimates population parameters when the sample size is small..kurtosis (Fisher or Pearson)Measures the "tailedness" of a distribution.Z-scoreMeasures how many standard deviations a data point is from the mean.coefficient of variationRatio of the standard deviation to the mean, used to measure variability.cumulative frequencyShows the running total of frequencies in a dataset.Continuous Random Variable A random variable that can take any value within a range (e.g., height, weight, temperature).gamma continuous random variableUsed to model waiting times or life durations. exponential continuous random variableModels time between events in a Poisson process.generalized exponential continuous random variableFlexible for modeling data with additional shape parameters.alpha continuous random variableOften related to the shape parameter in various probability distributions.Erlang continuous random variableA special case of the Gamma distribution, used in queuing theory.chi continuous random variableUsed in hypothesis testing, particularly the Chi-squared test.Uniform continuous random variableAll outcomes within a range are equally likely.Half-logistic continuous random variableA variant of the logistic distribution, limited to positive values.arcsine continuous random variableModels the arcsine of a variable, often used for angle-related data.Half-Cauchy continuous random variableSimilar to Cauchy but restricted to positive values, often used in Bayesian statistics.cosine continuous random variableBased on the cosine function, used in circular data applications.Frechet left ( Weibull maximum)Models the maximum values in a dataset, like the largest daily rainfall.Related Article:Statistics in SciPyOptimization in SciPyIntegration in SciPyData Analysis with SciPy Comment More infoAdvertise with us Next Article SciPy Statistical Functions K kareeen0d5l Follow Improve Article Tags : Data Science Library AI-ML-DS With Python Data Science Similar Reads Linear Regression in Machine learning Linear regression is a type of supervised machine-learning algorithm that learns from the labelled datasets and maps the data points with most optimized linear functions which can be used for prediction on new datasets. It assumes that there is a linear relationship between the input and output, mea 15+ min read Logistic Regression in Machine Learning Logistic Regression is a supervised machine learning algorithm used for classification problems. Unlike linear regression which predicts continuous values it predicts the probability that an input belongs to a specific class. It is used for binary classification where the output can be one of two po 11 min read K means Clustering â Introduction K-Means Clustering is an Unsupervised Machine Learning algorithm which groups unlabeled dataset into different clusters. It is used to organize data into groups based on their similarity. Understanding K-means ClusteringFor example online store uses K-Means to group customers based on purchase frequ 4 min read K-Nearest Neighbor(KNN) Algorithm K-Nearest Neighbors (KNN) is a supervised machine learning algorithm generally used for classification but can also be used for regression tasks. It works by finding the "k" closest data points (neighbors) to a given input and makesa predictions based on the majority class (for classification) or th 8 min read Introduction to Convolution Neural Network Convolutional Neural Network (CNN) is an advanced version of artificial neural networks (ANNs), primarily designed to extract features from grid-like matrix datasets. This is particularly useful for visual datasets such as images or videos, where data patterns play a crucial role. CNNs are widely us 8 min read Naive Bayes Classifiers Naive Bayes is a classification algorithm that uses probability to predict which category a data point belongs to, assuming that all features are unrelated. This article will give you an overview as well as more advanced use and implementation of Naive Bayes in machine learning. Illustration behind 7 min read Principal Component Analysis(PCA) PCA (Principal Component Analysis) is a dimensionality reduction technique used in data analysis and machine learning. It helps you to reduce the number of features in a dataset while keeping the most important information. It changes your original features into new features these new features donât 7 min read Introduction to Recurrent Neural Networks Recurrent Neural Networks (RNNs) differ from regular neural networks in how they process information. While standard neural networks pass information in one direction i.e from input to output, RNNs feed information back into the network at each step.Lets understand RNN with a example:Imagine reading 10 min read Decision Tree A Decision Tree helps us to make decisions by mapping out different choices and their possible outcomes. Itâs used in machine learning for tasks like classification and prediction. In this article, weâll see more about Decision Trees, their types and other core concepts.A Decision Tree helps us make 6 min read Random Forest Algorithm in Machine Learning Random Forest is a machine learning algorithm that uses many decision trees to make better predictions. Each tree looks at different random parts of the data and their results are combined by voting for classification or averaging for regression. This helps in improving accuracy and reducing errors. 5 min read Like