Machine-Learning-Package

This is an open-source machine learning repository in Python developed by research students at at Colgate University. This package imploys some of the most popular feature selection and classification methods to build predictive and analytical models for any dataset.

The package impute missing data, performs grid search to find the best predictive model, and oversample data to get rid of any imbalance in the dataset which makes it easy for people with no coding experience to implement in their own field of research.

How to use

The package is run from main.py where you can enter your file name, specify the dependent variable, and configure the predictive models. In addition, all the functions in the package are provided with detailed reference for easy navigation.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.

Prerequisites

Beside the Anaconda package, these libraries will be needed:

pandas
os
numpy
imblearn
scikit-learn
pyreadstat
statistics
math
statsmodels.api
joblib
xlsxwriter
openpyxl
glob
seaborn
matplotlib
ReliefF
xgboost

You can install these libraries by running the "conda install -c conda-forge" or "pip install -U" commands followed by the name of the library in the command line for Windows/MacOs, and "conda install -c conda-forge" or "pip3 install -U" for Linux.

Please refer to the Anaconda docs (https://docs.anaconda.com/anaconda/install/) on how to install the Anaconda package.

Contact

Ziad Attia: zattia@colgate.edu

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
ARL.R		ARL.R
FCBF.py		FCBF.py
JMI.py		JMI.py
LICENSE		LICENSE
MRMR.py		MRMR.py
README.md		README.md
Stats_R.R		Stats_R.R
classifiers.py		classifiers.py
data_preprocess.py		data_preprocess.py
featureselection.py		featureselection.py
main.py		main.py
ranking_subset_run.py		ranking_subset_run.py
runs.py		runs.py
scoring.py		scoring.py
stats.py		stats.py
su_calculation.py		su_calculation.py
uni_multiStats.py		uni_multiStats.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine-Learning-Package

How to use

Getting Started

Prerequisites

Contact

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

Ziad-Attia/Machine-Learning-Package

Folders and files

Latest commit

History

Repository files navigation

Machine-Learning-Package

How to use

Getting Started

Prerequisites

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages