I built a chatbot using transformer models trained on datasets like Cornell Movie Dialogs and Reddit chats.
It understands and replies like a human by learning context through self-attention mechanisms.
I fine-tuned it to give relevant, consistent, and tone-controlled responses using methods like beam search and sampling.
This project uses the Cornell Movie Dialogs Corpus, stored in the cornell_movie_dialogs folder.
๐ง The dataset includes:
movie_lines.txtmovie_conversations.txtmovie_characters_metadata.txtmovie_titles_metadata.txtraw_script_urls.txtchameleons.pdfpairs_encoded.jsonWORDMAP_corpus.json
Some files are large and tracked using Git LFS.
If you clone this repo, make sure to install Git LFS first:
git lfs install
git clone https://github.com/Surajit00007/Intelligent_Chatbot_Development-project.git