Project Groot

Project Groot is a from-scratch implementation of a Transformer-based language model in PyTorch, designed to explore the space of Large Language Models (LLMs) through careful architectural choices, training stability experiments and new optimization techniques.

Note

Just like the Marvel character Groot appears in different forms and sizes, this project is designed to scale from Groot Tiny → Groot Small → Groot Medium → Groot Large, while keeping the core architecture and principles consistent.

Models

The current models are GPT-2 style, decoder-only transformer models. The models have been pretrained on the TinyStories Dataset and will subsequently be finetuned for general Question-Answering.

GrootTiny: As the name suggests, this is the smallest Groot LM, with just ~120 M parameters.
- Pretraining with TinyStories Dataset
- Finetuning with General Question-Answering
- Training on Wiki-text
GrootSmall: This is the follow-up model to GrootTiny, with ~250 M parameters.
- Pretraining with TinyStories Dataset
- Finetuning with General Question-Answering
- Training on Wiki-text

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
app		app
config		config
model		model
utils		utils
.gitignore		.gitignore
README.md		README.md
get_summary.py		get_summary.py
main.ipynb		main.ipynb
make_data.py		make_data.py
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Groot

Models

About

Uh oh!

Releases

Packages

Languages

gokulmk-12/Project-Groot

Folders and files

Latest commit

History

Repository files navigation

Project Groot

Models

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages