Skip to content

kenoharada/language-model-from-scratch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

language-model-from-scratch

Learn how to develop language model by developing tiny language models.

Contents

  • Playing with language models by prompting
  • Data preparation
    • Tinystories taste synthetic data
    • wikipedia deta
  • Tokenizer
  • Ngram language model
  • Attention
  • Pretraining
  • Instruction finetuning
  • RLHF (RLAIF)

Setup

# install pytorch following https://pytorch.org/
# other libraries
pip3 install -r requirements.txt

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published