--- redirect_from: - "/chapters/14/why-the-mean-matters" interact_link: content/chapters/14/Why_the_Mean_Matters.ipynb kernel_name: python3 has_widgets: false title: |- Why the Mean Matters prev_page: url: /chapters/13/4/Using_Confidence_Intervals.html title: |- Using Confidence Intervals next_page: url: /chapters/14/1/Properties_of_the_Mean.html title: |- Properties of the Mean comment: "***PROGRAMMATICALLY GENERATED, DO NOT EDIT. SEE ORIGINAL FILES IN /content***" ---
In this course we have studied several different statistics, including total variation distance, the maximum, the median, and also the mean. Under clear assumptions about randomness, we have drawn empirical distributions of all of these statistics. Some, like the maximum and the total variation distance, have distributions that are clearly skewed in one direction or the other. But the empirical distribution of the sample mean has almost always turned out close to bell-shaped, regardless of the population being studied.
If a property of random samples is true regardless of the population, it becomes a powerful tool for inference because we rarely know much about the data in the entire population. The distribution of the mean of a large random sample falls into this category of properties. That is why random sample means are extensively used in data science.
In this chapter, we will study means and what we can say about them with only minimal assumptions about the underlying populations. Question that we will address include: