uthmandevsec

uthmandevsec

Popular repositories Loading

Self-Distillation Self-Distillation Public

🤖 Enable continual learning by reproducing the On-Policy Self-Distillation algorithm for robust and efficient fine-tuning with TRL-based code.

Python 2 1
ebook ebook Public

Python
ProjetYoutube ProjetYoutube Public

Python
uthmandevsec.github.io uthmandevsec.github.io Public

🤖 Reproduce the On-Policy Self-Distillation algorithm to enhance continual learning in models, minimizing forgetting while learning from demonstrations.