Central Tendency:
A measure of central tendency is a single value that attempts to describe a set of data by identifying the central position within that set of data. As such, measures of central tendency are sometimes called measures of central location.
The mean, median and mode are all valid measures of central tendency
mean() |
Arithmetic mean (“average”) of data. |
harmonic_mean() |
Harmonic mean of data. |
median() |
Median (middle value) of data. |
median_low() |
Low median of data. |
median_high() |
High median of data. |
median_grouped() |
Median, or 50th percentile, of grouped data. |
mode() |
Mode (most common value) of discrete data. |
Python Code:
Mean:
The arithmetic mean is the sum of the data divided by the number of data points. It is commonly called “the average”, although it is only one of many different mathematical averages. It is a measure of the central location of the data.
import statistics as st
grades = [100, 100, 90, 40, 80, 100, 85, 70, 90, 65, 90, 85, 50.5]
st.mean(grades)
81.82142857142857
Harmonic Mean:
The harmonic mean, sometimes called the subcontrary mean, is the reciprocal of the arithmetic mean()
of the reciprocals of the data. For example, the harmonic mean of three values a, b and c will be equivalent to 3/(1/a + 1/b + 1/c)
.
print(st.harmonic_mean(grades))
76.1552251422324
Median:
It return the median (middle value) of numeric data
print(st.median(grades))
87.5
Mode:
It return the most common data point from discrete or nominal data. The mode (when it exists) is the most typical value, and is a robust measure of central location.
print(st.mode(grades))
100
note:If data is empty, or if there is not exactly one most common value, StatisticsError
is raised.
Dispersion:
Dispersion in statistics is a way of describing how spread out a set of data is. When a data set has a large value, the values in the set are widely scattered; when it is small the items in the set are tightly clustered. Very basically, this set of data has a small value:
1, 2, 2, 3, 3, 4
…and this set has a wider one:
0, 1, 20, 30, 40, 100
In Python we can calculate the dispersion from following methods:
pstdev() |
Population standard deviation of data. |
pvariance() |
Population variance of data. |
stdev() |
Sample standard deviation of data. |
variance() |
Sample variance of data. |
Variance: Variance is a measurement of dispersion of the data from the mean value of the distribution. It tells how far the data points lie from the mean of the distribution.
Var(X)=E[(X-µ)2 ] for a population
Var(X)=E[(X-‾x)2 ] for a sample
Python:
print(st.pvariance(grades))
335.62882653061223
You may also obtain the recommendations from the experienced players, simply through the
web website.