1. Definition
For a random variable $X$, the cumulative distribution function (CDF) is:
$F(x) = P(X \leq x)$
- It gives the probability that the variable takes a value less than or equal to $x$.
- Defined for all real numbers $x$.
- Range: $0 \leq F(x) \leq 1$.
2. Properties
- Non-decreasing: $F(x)$ never decreases as $x$ increases.
- Limits:
- $\lim_{x \to -\infty} F(x) = 0$
- $\lim_{x \to +\infty} F(x) = 1$
- Right-continuous: No sudden jumps except for discrete variables.
3. Relationship with PDF/PMF
- If $X$ is continuous with density $f(x)$: $F(x) = \int_{-\infty}^x f(t)\,dt$ (Derivative: $f(x) = F'(x)$).
- If $X$ is discrete with probability mass function $p(x)$): $F(x) = \sum_{t \leq x} p(t)$
4. Examples
Discrete Example
Let $X$ = dice roll {1,2,3,4,5,6}, each with $p=1/6$.
- $F(3) = P(X \leq 3) = 1/6 + 1/6 + 1/6 = 0.5$.
- $F(6) = 1.0$.
Continuous Example
If $X \sim N(0,1)$ (standard normal):
- $F(0) = 0.5$.
- $F(1.645) \approx 0.95$.
- Meaning: 95% of values lie below 1.645.
5. Applications
- Probabilities: To find chance of being within an interval: $P(a < X \leq b) = F(b) – F(a)$
- Percentiles / Quantiles:
- 25th percentile = smallest $x$ such that $F(x) = 0.25$.
- Median = 50th percentile = $F(x)=0.5$.
- Simulation: Generate samples by inverting the CDF.
- Forecasting: Prediction intervals are based on quantiles of the forecast CDF.
6. Visualization
- Discrete CDF → step function.
- Continuous CDF → smooth S-shaped (sigmoid-like) curve.
Example: Standard normal CDF is symmetric and smooth, crossing 0.5 at $x=0$.
Summary:
The Cumulative Distribution Function (CDF) $F(x) = P(X \leq x)$ gives the probability that a random variable is less than or equal to a value. It’s non-decreasing, ranges from 0 to 1, and is central for computing probabilities, quantiles, and prediction intervals.
