🚀 Nano-GPT: Transformer-Based Language Model

An interactive Transformer language model trained on the Tiny Shakespeare dataset. 📜

Inspired by Andrej Karpathy, this project helped me understand the core concepts behind Transformers, like self-attention and sequence generation. I’m now working on scaling this up to create a GPT-2 clone! 🚀

📘 What I Built

Self-Attention Mechanism: Helps the model focus on relevant tokens in the sequence.
Multi-Head Attention: Captures diverse relationships between tokens for richer understanding.
Positional Embeddings: Gives the model a sense of token order, crucial for sequences.
Layer Normalization: Stabilizes and accelerates training for better convergence.
Feedforward Neural Network: Processes attention outputs to extract complex features.
Token and Positional Embeddings: Converts characters into meaningful vector representations.
Greedy & Multinomial Sampling: Enables flexible and creative text generation.

🛠️ Model Details

🧠 Parameters: ~3 Million
📚 Dataset: Tiny Shakespeare (~1MB of Shakespeare’s works)
⚡ Training Split: 90% training, 10% validation
📊 Loss Estimation: Robust evaluation on both train and validation sets

After training, the model can generate Shakespeare-like text — it’s incredible to see how it captures the style and rhythm of classic literature! ✨

🚀 Try It Out!

🔗 Live Demo on Hugging Face
🔗 Explore the Code on Kaggle

💡 What’s Next?

I’m working on scaling this up into a GPT-2 clone, and I’m exploring optimizations like rotary embeddings and flash attention. If you have ideas or suggestions, I’d love to hear them! Let’s keep learning and building together. 🚀

🤝 Acknowledgments

A huge shoutout to Andrej Karpathy for making complex concepts like self-attention, multi-head attention, and layer normalization so understandable — your teachings inspired this project! 🙌

#Demo_pic

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
checkpoints		checkpoints
README.md		README.md
app.py		app.py
nano-gpt_training.ipynb		nano-gpt_training.ipynb
requirement.txt		requirement.txt
tiny_shakespear.txt		tiny_shakespear.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Nano-GPT: Transformer-Based Language Model

📘 What I Built

🛠️ Model Details

🚀 Try It Out!

💡 What’s Next?

🤝 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

149189/Nano-GPT

Folders and files

Latest commit

History

Repository files navigation

🚀 Nano-GPT: Transformer-Based Language Model

📘 What I Built

🛠️ Model Details

🚀 Try It Out!

💡 What’s Next?

🤝 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages