Skip to content
/ FEPS Public

FEPS (Feature Extraction from Protein Sequences) is a web-based bioinformatics tool for computing 2765 sequence-derived protein features across 7 feature groups. It supports machine learning and deep learning applications such as protein function prediction, classification, and localization.

License

Notifications You must be signed in to change notification settings

hamiddi/FEPS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🧬 FEPS: Feature Extraction from Protein Sequences

FEPS is a comprehensive web-based bioinformatics tool designed to extract the most widely used sequence-derived features from protein sequences.

Pioneering the field of automated protein feature extraction, FEPS was first released in 2016 and has since evolved into a powerful platform supporting both traditional machine learning and modern deep learning approaches.

DOI


πŸ” Key Features

  • 48 distinct feature extraction methods
  • Organized into 7 major feature groups
  • Computes a total of 2765 descriptors
  • Supports multiple input groups and class labels
  • Direct integration with ML/DL pipelines (SVM, RF, KNN, CNN, etc.)
  • Designed for scalability, reproducibility, and usability

πŸ“– Ismail et al., 2022 β€” FEPS: A Tool for Feature Extraction from Protein Sequence


🎯 Applications

FEPS-generated features can be used for a wide range of classification and prediction tasks in computational biology, such as:

  • 🧠 Protein function prediction
  • 🧬 Protein classification
  • 🧱 Protein structure prediction
  • πŸ“ Subcellular localization prediction
  • βž• And many more!

🌐 Access the Webserver

πŸ‘‰ Launch the FEPS tool: https://hdismail.com/feps/


πŸ“₯ Input Format

FEPS accepts fasta-formatted protein sequence files. The recommended format for classification scenarios:

  • βœ… Sequences must be valid protein sequences
  • βœ… Files must be in FASTA format
  • βœ… Sequences of the same class/group should be combined into one file
  • βœ… The filename will be used to label the group

πŸ“¦ Download example dataset

πŸ’‘ Replace your-username/your-repo-name with the actual GitHub repo path.


🧾 Feature Types

The extracted features are organized into 7 major groups:

Group Feature Category
1 Amino Acid Composition
2 Composition, Transition, and Distribution
3 Autocorrelation Descriptors
4 Pseudo Amino Acid Composition
5 Quasi & Sequence-Order-Coupling
6 Shannon Entropy Descriptors
7 Conjoint Triad

Users can interactively select any group and corresponding methods using the web interface.


πŸš€ Getting Started

πŸ‘‰ Visit the FEPS Webserver: https://hdismail.com/feps/
πŸ‘‰ Read the full article: Methods in Molecular Biology, 2022
πŸ‘‰ Download example dataset


πŸ“„ Citation

If you use FEPS in your research, please cite:

Ismail H, White C, Al-Barakati H, Newman RH, Kc DB. (2022). FEPS: A Tool for Feature Extraction from Protein Sequence. Methods Mol Biol. 2499:65–104. https://doi.org/10.1007/978-1-0716-2317-6_3


πŸ§‘β€πŸ’» Authors

  • Dr. Hamid D. Ismail
  • Collaborators: White C, Al-Barakati H, Newman RH, Kc DB

πŸ“¬ Contact

For questions or feature requests, please contact:
πŸ“§ hamid@hamiddi.com


πŸ“˜ License

This project is licensed under the MIT License.

About

FEPS (Feature Extraction from Protein Sequences) is a web-based bioinformatics tool for computing 2765 sequence-derived protein features across 7 feature groups. It supports machine learning and deep learning applications such as protein function prediction, classification, and localization.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published