A collection of datasets sourced from platforms like Kaggle, accompanied by data cleaning processes.
This repository includes both raw data and the corresponding cleaned versions, with outliers and extreme values removed.
It serves as a demonstration of data preprocessing and cleaning techniques across various types of datasets.
- Raw datasets
- Cleaned datasets
- Notes on data cleaning steps
This repository showcases skills in data preparation, outlier removal, and basic preprocessing.
It is intended as a portfolio of data handling and cleaning practices.
- Kaggle
- Other open data platforms
This project is for educational and portfolio purposes.
Please check individual dataset licenses before reuse.