Continuous Probability Distributions
Probability density functions, Normal, Uniform, Exponential, Gamma, Weibull, and Lognormal distributions.
Unlike discrete variables which are counted (e.g., number of cracks), continuous variables are measured (e.g., length, time, pressure, strength). Because a continuous variable can take on infinitely many values within a range, the probability of it taking any specific exact value is zero. Instead, we calculate the probability that the variable falls within a specified interval.
Probability Density Functions
The continuous analog to a probability mass function.
Probability Density Function (PDF),
A function describing the relative likelihood for a continuous random variable to take on a given value. The probability that lies between and is the area under the curve from to .
It must satisfy two conditions:
- for all .
- (the total area under the curve is 1).
The Normal Distribution
The most important continuous distribution in statistics.
The Normal (Gaussian) distribution is pervasive because many natural phenomena and measurement errors follow a bell-shaped curve.
Normal Distribution
A continuous, symmetric, bell-shaped distribution completely defined by its mean () and standard deviation ().
- It is centered at , which is also its median and mode.
- The spread is determined by ; a larger results in a flatter, wider curve.
The Standard Normal Distribution
Converting any normal distribution to a standard scale.
To calculate probabilities for any normal distribution, we standardize the variable into a -score, which represents the number of standard deviations is from the mean.
Z-Score
A dimensionless quantity used to standardly map any normal distribution to a Standard Normal Distribution (, ).
Once is found, probabilities are looked up in a Standard Normal Table (Z-table) or calculated via software.
Other Common Continuous Distributions
Models used for specific engineering scenarios, particularly in reliability and failure analysis.
The Uniform Distribution
Continuous Uniform Distribution
Used when all values within an interval are equally likely. The PDF forms a rectangle. (e.g., Rounding errors in digital measurements).
- Mean:
- Variance:
The Exponential Distribution
Exponential Distribution
Closely related to the Poisson distribution. While Poisson models the number of occurrences in a fixed interval, the Exponential distribution models the time between occurrences (e.g., time between structural failures, or lifespan of a lightbulb).
- Mean:
- Variance:
Memoryless Property: The probability of failure in the next instant does not depend on how long the component has already survived.
The Gamma, Weibull, and Lognormal Distributions
Critical distributions for reliability engineering and material strength.
Gamma Distribution
A generalization of the Exponential distribution. It models the time until (the shape parameter, often denoted ) consecutive events occur, rather than just the first event.
Weibull Distribution
Extensively used in reliability engineering to model the "time to failure" of materials and mechanical systems (e.g., fatigue life of asphalt pavements or steel bearings). Unlike the memoryless Exponential distribution, Weibull can model failure rates that increase over time (wear-out) or decrease over time (early infant mortality).
- (Shape parameter): If , the failure rate increases over time (wear-out phase).
- (Scale parameter or characteristic life): The time by which 63.2% of the population will have failed.
Lognormal Distribution
If a variable follows a Normal distribution, then follows a Lognormal distribution. It is widely used to model environmental data, such as stream flows, pollutant concentrations, and grain sizes in soils, because it is right-skewed and bounded at zero (variables cannot be negative).
Key Takeaways
- PDFs: For continuous variables, probability is the area under the PDF curve. The probability of an exact value is zero.
- Normal Distribution: The benchmark bell-shaped curve. Use -scores to standardize and find probabilities.
- Uniform: Constant probability over an interval.
- Exponential: Time between independent, random events. It is memoryless.
- Gamma: Time until events occur.
- Weibull: The standard for modeling material fatigue life and "time to failure" with changing failure rates.
- Lognormal: Ideal for highly skewed positive data, like pollutant concentrations or streamflow.