This repository contains a collection of Python Projects for Data Science, Analysis, Preprocessing, and Machine Learning. It covers essential Python libraries, Data Manipulation techniques, Scaling, Regression, Classification, and Practical Applications on Real Datasets.
-
Basic Libraries
Fundamental Python libraries:NumPy– Numerical ComputingMatplotlib– Basic PlottingSeaborn– Advanced Visualizations
-
Pandas Project
- Analysis on
car_data.csvusing Pandas. - Demonstrates 30 commonly used Pandas functions for Data Manipulation.
- Analysis on
-
Scaling
- Explains Standardisation and Normalisation.
- Mathematical walkthrough and Python implementation.
-
Regression Project
- Applies Linear and Logistic Regression on
lifestyle.csv. - Focus on model training, prediction, and evaluation metrics.
- Applies Linear and Logistic Regression on
-
Iris Analysis
- Quick analysis of the built-in Iris Dataset.
- Logistic Regression (Classification).
-
Decision Tree & Random Forest Project
- Applied Decision Tree and Random Forest Classifiers on
camera_dataset.csv. - Evaluation using Accuracy and Feature Importance.
- Applied Decision Tree and Random Forest Classifiers on
-
KNN Project
- Explains basics of K-Nearest Neighbors (KNN) for Regression and Classification.
- Applied on
_bills.csvDataset to classify fake or real bills.
-
E-Challan Cameras Project
- In-depth Analysis on
cameras.csvto identify the best cameras for a Traffic E-Challan System. - Preprocessing & Feature Engineering
- Rule-based Suitability Score using MinMax Scaling
-
Regression: Predict the rule-based score using
LinearRegressionandRandomForestRegressor. -
Classification: Convert score to classes (
High,Medium,Low) and predict usingLogisticRegressionandRandomForestClassifier.
- Regression achieved R² > 0.98 and Classification achieved 97% Accuracy.
- Key features influencing suitability:
Low and Effective Resolution. - Combines rule-based and ML methods for transparent, data-driven decisions.
- In-depth Analysis on
- Python 3.10+
- Libraries:
numpy,pandas,matplotlib,seaborn,scikit-learn
You can install dependencies via:
pip install -r requirements.txt