In the world of statistics, we don't just care about the distribution of individual people—we care about the distribution of samples. If you take a sample of 10 people and calculate their average height, that "average" () is itself a random variable. The distribution of such a statistic is called a Sampling Distribution.
Imagine taking a standard normal variable and squaring it. Since can never be negative, the distribution is "pushed" to the right. If you sum of these squared variables, you get a Chi-Squared distribution with degrees of freedom.
Why do we care about Chi-Squared? Because Sample Variance () follows a Chi-Squared distribution (specifically, ). If you want to test whether a new manufacturing process has reduced the "spread" of defects, you are performing a Chi-Squared test.
Chi-Squared Distributions
As the degrees of freedom (k) increase, the distribution moves to the right and becomes more 'Normal' (thanks to the Central Limit Theorem!).
The t-distribution was invented by William Gosset, a chemist at the Guinness Brewery in 1908. He needed a way to monitor the quality of stout using only very small samples. Since Guinness forbade its employees from publishing secrets, he wrote under the pseudonym "Student."
When we don't know the true population variance (which is almost always the case), we have to estimate it from our data. This extra layer of estimation adds "noise." The t-distribution looks like a Normal curve but with Fat Tails—it effectively says, "With a small sample, extreme outcomes are more likely than the Normal distribution would predict."
Standardization with Estimated Variance
If , but is unknown, then:
Standard Normal vs. t-Distributions
Notice how the t-distribution (green) has much more area in the tails than the Normal (blue). This is the 'uncertainty penalty' for having a small sample size.
What is the probability that the fastest person in a race breaks a record? Or that the median house price in a city stays below $500k? These are questions about Order Statistics.
If we sort a sample from smallest to largest: , then is the -th order statistic.
Order statistics are used to predict the "100-year flood." This is the (the sample maximum) of annual rainfall over a century. By studying the distribution of the maximum, engineers can build bridges strong enough to survive rare, extreme storms.
Understanding the Median
- 1SortArrange the data from smallest to largest.
- 2PickSelect the value at position (n+1)/2 (the median).
- 3InterpretThe median is the (n+1)/2-th Order Statistic. Unlike the mean, it is robust to outliers—one billionaire in a room of poor people won't change the median income much, but they will skyrocket the mean.