Probability & Statistics

Beta Distribution

Explore the Beta distribution and the effect of its shape parameters.

Beta Distribution

Concept Overview

The Beta distribution is a family of continuous probability distributions defined on the interval [0, 1] parameterized by two positive shape parameters, denoted by α (alpha) and β (beta). It is highly versatile and is often used to model random variables whose values are limited to intervals of finite length, such as proportions, probabilities, or percentages. In Bayesian inference, it serves as the conjugate prior probability distribution for the Bernoulli, binomial, negative binomial, and geometric distributions.

Mathematical Definition

The probability density function (PDF) of the Beta distribution, for 0 ≤ x ≤ 1, and shape parameters α, β > 0, is a power function of the variable x and of its reflection (1 − x):

f(x; α, β) = xα-1(1-x)β-1 / B(α, β)
Where the Beta function, B(α, β), acts as a normalizing constant to ensure the total probability integrates to 1:
B(α, β) = ∫01 tα-1(1-t)β-1 dt
The Beta function can also be expressed in terms of the Gamma function (Γ):
B(α, β) = Γ(α)Γ(β) / Γ(α+β)
Key summary statistics:
E[X] = α / (α + β)   (expected value / mean)
Mode = (α - 1) / (α + β - 2)   (for α, β > 1)
Var(X) = αβ / ((α + β)2(α + β + 1))   (variance)

Key Concepts

Shape Parameters

The parameters α and β determine the shape of the distribution. If α = β, the distribution is symmetric. If α > β, the distribution is skewed towards 1 (left-skewed). If α < β, it is skewed towards 0 (right-skewed). When α = β = 1, the Beta distribution reduces to the Uniform distribution on [0, 1]. For α < 1 and β < 1, the distribution is U-shaped. For α > 1 and β > 1, it is unimodal.

Bayesian Updating

In Bayesian statistics, the Beta distribution is commonly used as a prior distribution for binomial proportions. If our prior distribution of a probability p is Beta(α, β), and we observe s "successes" and f "failures" in a binomial experiment, the posterior distribution is remarkably simple: Beta(α + s, β + f). This makes it incredibly useful for sequentially updating beliefs as new data arrives.

Relationship with Gamma Distribution

The Beta distribution is closely related to the Gamma distribution. If X ~ Gamma(α, θ) and Y ~ Gamma(β, θ) are independent random variables, then the ratio X / (X + Y) follows a Beta(α, β) distribution. This relationship provides a practical method for generating Beta-distributed random numbers.

Historical Context

The mathematical foundations of the Beta distribution lie in the Beta function, which was studied by Leonhard Euler and Adrien-Marie Legendre in the 18th and 19th centuries as they expanded the calculus of factorials (the Gamma function) to non-integers.

Its formal use in probability theory was pioneered by Karl Pearson in 1895 as "Type I" in his system of continuous probability distributions. Pearson developed this system to model a wide variety of skewed data observed in biology and other empirical sciences that the normal distribution could not adequately describe.

Real-world Applications

  • A/B Testing and Conversion Rates: Used extensively in Bayesian A/B testing to model the true conversion rate of different variants based on observed successes and failures.
  • Project Management (PERT): In the Program Evaluation and Review Technique (PERT), task durations are modeled using a transformed Beta distribution to estimate the expected completion time given optimistic, most likely, and pessimistic estimates.
  • Reliability Engineering: Used to model the probability of failure of physical systems or components, often acting as the distribution for the parameter of a binomial failure process.
  • Population Genetics: Models the allele frequencies in a population under the effects of mutation and genetic drift, such as in Wright's distribution.
  • Sports Analytics: Used to estimate a player's true batting average or shooting percentage based on limited early-season observations.

Related Concepts

  • Probability Distributions — the Beta distribution is a key continuous distribution.
  • Bayes' Theorem — the foundational principle for using the Beta distribution as a conjugate prior.
  • Hypothesis Testing — Bayesian equivalents to traditional hypothesis tests often rely on Beta posteriors.
  • Monte Carlo Simulation — generating samples from Beta distributions is a common step in complex simulations.

Experience it interactively

Adjust parameters, observe in real time, and build deep intuition with Riano’s interactive Beta Distribution module.

Try Beta Distribution on Riano →

More in Probability & Statistics