GitHub - saziaa/Decoding_Energy_Consumption: Machine learning project predicting energy consumption in East Melbourne WWTP using ensemble and deep learning models, with SHAP-based feature importance analysis.

Decoding Energy Consumption: ML Predictions & Model Comparison in East Melbourne WWTP

📌 Project Overview

Wastewater treatment plants (WWTPs) are essential for environmental sustainability, but their energy-intensive operations pose cost and carbon emission challenges. This project focuses on the East Melbourne Wastewater Treatment Plant, aiming to predict energy consumption (EC) based on organic, hydraulic, and climatic parameters and compare the performance of various machine learning models.

🔹 Objectives

Identify key factors influencing energy consumption in WWTP operations.

Develop and compare machine learning and deep learning models for EC prediction.

Evaluate model robustness, efficiency, and predictive accuracy using historical data.

Provide actionable insights to optimize energy usage and enhance sustainability.

🔹 Data

Data Source: East Melbourne WWTP Dataset - Mendeley Data

Years Covered: 2014–2019

Features: Organic parameters, hydraulic flow, climatic factors, and energy consumption metrics

Target Variable: Daily Electrical Conductivity (EC)

🔹 Methodology

Feature Selection:

SHAP values used to assess feature importance.

Influential features identified: Month, Total Nitrogen, COD, Average Inflow, Average Temperature.

Models Evaluated:

Traditional: Ridge Regression, Support Vector Regression (SVR)

Ensemble Methods: Random Forest (RF), Gradient Boosting (GB), XGBoost

Deep Learning: Artificial Neural Network (ANN), Convolutional Neural Network (CNN), Long Short-Term Memory Network (LSTM)

Validation & Evaluation:

Time series cross-validation

Metrics: RMSE, MAE, MAPE, Median Absolute Deviation (MAD)

Residual and confidence interval analyses

🔹 Key Findings

Random Forest is the most effective model for predicting EC, followed by Gradient Boosting, Ridge Regression, and LSTM.

Predictions are unbiased and robust across the test dataset.

Important predictive features: Month, Total Nitrogen, COD, Average Inflow, Average Temperature.

Ensemble methods outperform traditional and deep learning approaches in this dataset.

🔹 Tools & Libraries

Language: Python 3.9

Platform: Google Colab

Libraries: Pandas, NumPy, Scikit-learn, Keras, TensorFlow, Scikeras, Matplotlib, Seaborn

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
CIND_860_code_and_result.ipynb		CIND_860_code_and_result.ipynb
CIND_860_report_v0.1.pdf		CIND_860_report_v0.1.pdf
Dataset_Melbourne_Wastewater_Treatment.csv		Dataset_Melbourne_Wastewater_Treatment.csv
README.md		README.md
profile_report_EC.html		profile_report_EC.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Decoding Energy Consumption: ML Predictions & Model Comparison in East Melbourne WWTP

About

Uh oh!

Releases

Packages

Languages

saziaa/Decoding_Energy_Consumption

Folders and files

Latest commit

History

Repository files navigation

Decoding Energy Consumption: ML Predictions & Model Comparison in East Melbourne WWTP

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages