SciPy - Home
SciPy - Introduction
SciPy - Environment Setup
SciPy - Basic Functionality
SciPy - Relationship with NumPy
SciPy Clusters
SciPy - Clusters
SciPy - Hierarchical Clustering
SciPy - K-means Clustering
SciPy - Distance Metrics
SciPy Constants
SciPy - Constants
SciPy - Mathematical Constants
SciPy - Physical Constants
SciPy - Unit Conversion
SciPy - Astronomical Constants
SciPy - Fourier Transforms
SciPy - FFTpack
SciPy - Discrete Fourier Transform (DFT)
SciPy - Fast Fourier Transform (FFT)
SciPy Integration Equations
SciPy - Integrate Module
SciPy - Single Integration
SciPy - Double Integration
SciPy - Triple Integration
SciPy - Multiple Integration
SciPy Differential Equations
SciPy - Differential Equations
SciPy - Integration of Stochastic Differential Equations
SciPy - Integration of Ordinary Differential Equations
SciPy - Discontinuous Functions
SciPy - Oscillatory Functions
SciPy - Partial Differential Equations
SciPy Interpolation
SciPy - Interpolate
SciPy - Linear 1-D Interpolation
SciPy - Polynomial 1-D Interpolation
SciPy - Spline 1-D Interpolation
SciPy - Grid Data Multi-Dimensional Interpolation
SciPy - RBF Multi-Dimensional Interpolation
SciPy - Polynomial & Spline Interpolation
SciPy Curve Fitting
SciPy - Curve Fitting
SciPy - Linear Curve Fitting
SciPy - Non-Linear Curve Fitting
SciPy - Input & Output
SciPy - Input & Output
SciPy - Reading & Writing Files
SciPy - Working with Different File Formats
SciPy - Efficient Data Storage with HDF5
SciPy - Data Serialization
SciPy Linear Algebra
SciPy - Linalg
SciPy - Matrix Creation & Basic Operations
SciPy - Matrix LU Decomposition
SciPy - Matrix QU Decomposition
SciPy - Singular Value Decomposition
SciPy - Cholesky Decomposition
SciPy - Solving Linear Systems
SciPy - Eigenvalues & Eigenvectors
SciPy Image Processing
SciPy - Ndimage
SciPy - Reading & Writing Images
SciPy - Image Transformation
SciPy - Filtering & Edge Detection
SciPy - Top Hat Filters
SciPy - Morphological Filters
SciPy - Low Pass Filters
SciPy - High Pass Filters
SciPy - Bilateral Filter
SciPy - Median Filter
SciPy - Non - Linear Filters in Image Processing
SciPy - High Boost Filter
SciPy - Laplacian Filter
SciPy - Morphological Operations
SciPy - Image Segmentation
SciPy - Thresholding in Image Segmentation
SciPy - Region-Based Segmentation
SciPy - Connected Component Labeling
SciPy Optimize
SciPy - Optimize
SciPy - Special Matrices & Functions
SciPy - Unconstrained Optimization
SciPy - Constrained Optimization
SciPy - Matrix Norms
SciPy - Sparse Matrix
SciPy - Frobenius Norm
SciPy - Spectral Norm
SciPy Condition Numbers
SciPy - Condition Numbers
SciPy - Linear Least Squares
SciPy - Non-Linear Least Squares
SciPy - Finding Roots of Scalar Functions
SciPy - Finding Roots of Multivariate Functions
SciPy - Signal Processing
SciPy - Signal Filtering & Smoothing
SciPy - Short-Time Fourier Transform
SciPy - Wavelet Transform
SciPy - Continuous Wavelet Transform
SciPy - Discrete Wavelet Transform
SciPy - Wavelet Packet Transform
SciPy - Multi-Resolution Analysis
SciPy - Stationary Wavelet Transform
SciPy - Statistical Functions
SciPy - Stats
SciPy - Descriptive Statistics
SciPy - Continuous Probability Distributions
SciPy - Discrete Probability Distributions
SciPy - Statistical Tests & Inference
SciPy - Generating Random Samples
SciPy - Kaplan-Meier Estimator Survival Analysis
SciPy - Cox Proportional Hazards Model Survival Analysis
SciPy Spatial Data
SciPy - Spatial
SciPy - Special Functions
SciPy - Special Package
SciPy Advanced Topics
SciPy - CSGraph
SciPy - ODR
SciPy Useful Resources
SciPy - Reference
SciPy - Quick Guide
SciPy - Cheatsheet
SciPy - Useful Resources
SciPy - Discussion

SciPy - Continuous Probability Distributions

Quiz

Continuous probability distributions refer to statistical models where the random variable can take any value within a specified range or interval. These distributions are fundamental in many scientific fields such as physics, engineering and economics, as they can model real-world scenarios like measurements or time intervals.

The scipy.stats library in Python provides an extensive collection of tools for working with these distributions by allowing us to calculate important statistical measures such as probability density functions (PDF), cumulative distribution functions (CDF) and more.

Key Continuous Distributions in SciPy

In SciPy continuous distributions represent random variables that can take any value within a range. SciPy provides a wide variety of continuous probability distributions and methods for working with them.

Normal Distribution

The Normal Distribution which often referred to as the Gaussian distribution, is one of the most commonly used continuous distributions in statistics. It has a symmetric bell-shaped curve, with the center of the distribution defined by its mean and the spread determined by its standard deviation. This distribution is widely applied in various fields like quality control, finance and natural sciences.

In SciPy the normal distribution is represented by the scipy.stats.norm object. Heres an example of calculating and visualizing the probability density and cumulative distribution of a normal distribution −

from scipy.stats import norm
import numpy as np
import matplotlib.pyplot as plt

# Define mean and standard deviation
mean = 0
std_dev = 1

# Generate an array of values for x between -5 and 5
x_values = np.linspace(-5, 5, 100)

# Calculate the probability density function (PDF) and cumulative distribution function (CDF)
pdf_values = norm.pdf(x_values, mean, std_dev)
cdf_values = norm.cdf(x_values, mean, std_dev)

# Plot the results
plt.figure(figsize=(12, 6))

# PDF plot
plt.subplot(1, 2, 1)
plt.plot(x_values, pdf_values, label='PDF')
plt.title('Normal Distribution - PDF')
plt.legend()

# CDF plot
plt.subplot(1, 2, 2)
plt.plot(x_values, cdf_values, label='CDF', color='red')
plt.title('Normal Distribution - CDF')
plt.legend()

plt.tight_layout()
plt.show()

Here is the output of the normal distribution calculated using scipy.stats.norm.pdf() and scipy.stats.norm.cdf() function −

Exponential Distribution

The Exponential Distribution is often used to model the time between events in a Poisson process, where the events occur independently and at a constant average rate. The distribution has a single parameter, (lambda) which represents the rate at which events happen. This distribution is useful for processes that involve waiting times.

In SciPy the exponential distribution can be handled with the scipy.stats.expon object. Heres an example of calculating and plotting the PDF and CDF for the exponential distribution −

from scipy.stats import expon
import numpy as np
import matplotlib.pyplot as plt

# Set the rate (lambda)
rate = 1

# Create an array of x values from 0 to 10
x_values = np.linspace(0, 10, 100)

# Compute the PDF and CDF
pdf_values = expon.pdf(x_values, scale=1/rate)
cdf_values = expon.cdf(x_values, scale=1/rate)

# Plot the distributions
plt.figure(figsize=(12, 6))

# PDF plot
plt.subplot(1, 2, 1)
plt.plot(x_values, pdf_values, label='PDF')
plt.title('Exponential Distribution - PDF')
plt.legend()

# CDF plot
plt.subplot(1, 2, 2)
plt.plot(x_values, cdf_values, label='CDF', color='red')
plt.title('Exponential Distribution - CDF')
plt.legend()

plt.tight_layout()
plt.show()

Following is the output of the Exponential distribution calculated using scipy.stats.expon.pdf() and scipy.stats.expon.cdf() function −

Gamma Distribution

The Gamma Distribution is a generalization of the exponential distribution that includes an additional parameter, the shape parameter which allows for a wider variety of distribution shapes. This distribution is frequently used in queuing theory and reliability analysis.

In SciPy the gamma distribution is represented by the scipy.stats.gamma object. Below is an example of calculating the PDF and CDF for the gamma distribution −

from scipy.stats import gamma
import numpy as np
import matplotlib.pyplot as plt

# Parameters for the gamma distribution
shape_param = 2
scale_param = 1

# Generate an array of x values
x_values = np.linspace(0, 10, 100)

# Compute the PDF and CDF
pdf_values = gamma.pdf(x_values, shape_param, scale=scale_param)
cdf_values = gamma.cdf(x_values, shape_param, scale=scale_param)

# Plot the results
plt.figure(figsize=(12, 6))

# PDF plot
plt.subplot(1, 2, 1)
plt.plot(x_values, pdf_values, label='PDF')
plt.title('Gamma Distribution - PDF')
plt.legend()

# CDF plot
plt.subplot(1, 2, 2)
plt.plot(x_values, cdf_values, label='CDF', color='red')
plt.title('Gamma Distribution - CDF')
plt.legend()

plt.tight_layout()
plt.show()

Below is the output of the Gamma distribution calculated using scipy.stats.gamma.pdf() and scipy.stats.gamma.cdf() function −

Beta Distribution

The Beta Distribution is a versatile distribution used to model random variables that are constrained to a fixed interval, typically between 0 and 1. It is often applied in scenarios where probabilities and proportions are involved such as in Bayesian statistics.

The beta distribution is represented in SciPy by scipy.stats.beta. Here's an example of plotting the PDF and CDF of a beta distribution −

from scipy.stats import beta
import numpy as np
import matplotlib.pyplot as plt

# Set the shape parameters for the beta distribution
alpha = 2
beta_param = 5

# Generate values for x in the range [0, 1]
x_values = np.linspace(0, 1, 100)

# Calculate the PDF and CDF
pdf_values = beta.pdf(x_values, alpha, beta_param)
cdf_values = beta.cdf(x_values, alpha, beta_param)

# Plot the results
plt.figure(figsize=(12, 6))

# PDF plot
plt.subplot(1, 2, 1)
plt.plot(x_values, pdf_values, label='PDF')
plt.title('Beta Distribution - PDF')
plt.legend()

# CDF plot
plt.subplot(1, 2, 2)
plt.plot(x_values, cdf_values, label='CDF', color='red')
plt.title('Beta Distribution - CDF')
plt.legend()

plt.tight_layout()
plt.show()

Below is the output of the Beta distribution calculated using scipy.stats.beta.pdf() and scipy.stats.beta.cdf() function −

Working with Continuous Distributions in SciPy

SciPy provides numerous methods for manipulating and working with continuous distributions which are mentioned as below −

PDF (Probability Density Function): distribution.pdf(x, params) computes the likelihood of a given value x.
CDF (Cumulative Distribution Function): distribution.cdf(x, params) calculates the cumulative probability up to the point x.
PPF (Percent-Point Function): distribution.ppf(p, params) returns the value corresponding to a specified cumulative probability p.
Random Sampling: distribution.rvs(params, size=N) generates N random values from the distribution.
Mean and Variance: distribution.mean() and distribution.var() calculate the mean and variance of the distribution.

For instance we can calculate the mean and variance of a normal distribution as follows −

from scipy.stats import norm

# Calculate the mean and variance of the normal distribution
mean = norm.mean(loc=0, scale=1)
variance = norm.var(loc=0, scale=1)

print("Mean of Normal Distribution:", mean)
print("Variance of Normal Distribution:", variance)

Here is the output of Mean and Variance of a normal distribution −

Mean of Normal Distribution: 0.0
Variance of Normal Distribution: 1.0

SciPys scipy.stats module offers a powerful suite of tools for working with continuous probability distributions. Whether we're analyzing simple distributions like the normal and exponential distributions or more complex models like the beta and gamma distributions, SciPy provides the necessary functions to calculate key statistical measures and perform in-depth analysis of continuous data.

Print Page