Course — Statistics Foundations: The Basics

Aakash Goel
3 min readFeb 21, 2022

Objective of writing: Preparing Notes or my learnings from course — Statistics Foundations: The Basics. Don’t have any intention to spam.

Mean, Median and Mode

Mean → Average

Mean of all 3 above dataset is same but points very far apart from each other in all 3. Hence, variability (range and standard deviation) in data is very important to measure.

Median → Middle element of data

Mode → Frequency of data (Most common value in dataset). Dataset may have no mode, single mode or multiple mode value

Message →Two students performing significantly poor, lowers down the average. If these two student have to complaint on exam, professor may ask to refer median.

Z-Score

If we have to tell, how far is `231` from Mean, one way of telling is 231 is 92 far from mean. We can use Z-Score to tell how far is data point from mean and in standard deviation

It means that data point `231` is 2.24 standard deviation from the mean in positive direction.

If z-score is -ve, it tells that point is less than mean.

Standard Deviation

Avg. square distance from mean

One dataset having Standard deviation of 20 and other dataset having standard deviation of 200. Which data have less variation ?

Ans. is it really depends, Like if 20 degree Celsius standard deviation in temperature in Paris in summer, its really high no.

Std. deviation can be used to study each data point as well. Consider below example, having 2 person’s salary ..

Person1 is Mean + 2 std. dev

Person2 is Mean -1 std. dev

Empirical Rule

Assuming, data is normally distributed, Most of data points in a given data set fall within 3 std. deviation of the mean.

Outlier

Data point that is an abnormal distance from the other values in the data set.

That’s all we have in this course. Thank you for reading.

Please clap if it helped you !!

https://www.linkedin.com/learning/certificates/fadb44644141c2e68b23deda734b0dfffcc44c5d22f8f9fad3cb77e084149e30?trk=share_certificate

END

--

--