Maths Study for DS, ML, and NLP A Weekly Plan for MATHS Study
- Week 1: DESCRIPTIVE STATISTICS
- GOALS: Understand and apply the basics of basic summary used to describe datasets
- Measure of Central Tendency
- mean
- median
- mode
- Measure of Dispersion
- Range
- Variance
- Standard Deviation
- Quantile(75th percentile - 25th percentile)
-
Outliers are unusually high or low values in your dataset that differ significantly from most other values.
-
Example:
-
In this list of ages:
- [22, 24, 25, 26, 150] — the 150 is clearly an outlier.
-
The mean (average) increases or decreases significantly if there's an outlier.
-
📌 Example:
-
Ages:
-
[25, 26, 27, 28, 29] → Mean = 27
-
[25, 26, 27, 28, 100] → Mean = 41.2 ← Big change due to one outlier
-
The median is the middle value, so it’s stable even with outliers.
-
📌 Example:
-
[25, 26, 27, 28, 100] → Median = 27
-
Same median as before!
- It’s just the most frequent value, so outliers don’t matter here.
-
These measure spread of data.
-
Outliers make the spread appear wider than it actually is.
