DeepSeek RAG Chatbot

A streamlined Retrieval-Augmented Generation (RAG) chatbot built with Streamlit and LangChain, using DeepSeek models through Groq's API. This application allows users to upload PDF documents and interact with them through natural language queries, featuring step-by-step reasoning and streaming responses.

DEMO

deepseek-rag-project.mp4

🌟 Features

PDF Document Processing: Upload and analyze PDF documents
Step-by-Step Reasoning: Watch the AI think through each question
Streaming Responses: Real-time response generation
Advanced Settings: Customize model parameters and processing options
Performance Metrics: Track response times and processing statistics
Clean UI: Modern, responsive interface with blue and white theme

🚀 Quick Start

Prerequisites

Python 3.8 or higher
Groq API key
OpenAI API key (for embeddings)

Installation

Clone the repository:

git clone https://github.com/Croups/rag-chatbot-deepseek
cd deepseek-rag-chatbot

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Create a .env file in the project root:

GROQ_API_KEY=your_groq_api_key
OPENAI_API_KEY=your_openai_api_key

Running the Application

Start the Streamlit app:

streamlit run app.py

Open your browser and navigate to http://localhost:8501

📁 Project Structure

deepseek-rag-chatbot/
├── app.py                 # Main application file
├── requirements.txt       # Project dependencies
├── .env                  # Environment variables
├── README.md             # Project documentation
└── document_store/       # Document storage directory
    └── pdfs/            # PDF storage directory

💡 Usage Guide

Setup
- Enter your Groq API key
- Select a DeepSeek model
- Configure advanced settings (optional)
Document Upload
- Upload a PDF document
- Documents are stored in document_store/pdfs/
- System processes and indexes the document
Chatting
- Ask questions about your document
- View the AI's thinking process
- Get clear, concise answers
Advanced Settings
- Temperature: Control response creativity (0.0-1.0)
- Chunk Size: Adjust text processing (500-2000)
- Chunk Overlap: Set context overlap (0-500)

🔧 Available Models

deepseek-r1-distill-qwen-32b: Balanced performance
deepseek-r1-distill-llama-70b: Best for complex tasks
llama3-70b-8192: Extended context window
gemma2-9b-it: Fast and efficient

🔍 Technical Details

Components

Frontend: Streamlit
RAG Implementation: LangChain
Embeddings: OpenAI Text Embeddings
LLM Provider: Groq
PDF Processing: PDFPlumber
Text Splitting: RecursiveCharacterTextSplitter

Process Flow

Document Upload → PDF Processing → Text Chunking
Chunk Embedding → Vector Storage
Query Processing → Context Retrieval
LLM Processing → Streaming Response

⚙️ Configuration Options

Setting	Range	Default	Description
Temperature	0.0-1.0	0.7	Controls response randomness
Chunk Size	500-2000	1000	Text chunk size for processing
Chunk Overlap	0-500	200	Overlap between chunks

🛠️ Troubleshooting

Common issues and solutions:

API Key Errors
- Verify API key length and format
- Check environment variable configuration
PDF Processing Issues
- Ensure PDF is not password protected
- Check file permissions
- Verify PDF is not corrupted
Memory Issues
- Reduce chunk size for large documents
- Process smaller sections if needed

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

👥 Support

Feel free to contact me on linkedin : www.linkedin.com/in/enes-koşar

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepSeek RAG Chatbot

DEMO

🌟 Features

🚀 Quick Start

Prerequisites

Installation

Running the Application

📁 Project Structure

💡 Usage Guide

🔧 Available Models

🔍 Technical Details

Components

Process Flow

⚙️ Configuration Options

🛠️ Troubleshooting

📄 License

👥 Support

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Croups/rag-chatbot-deepseek

Folders and files

Latest commit

History

Repository files navigation

DeepSeek RAG Chatbot

DEMO

🌟 Features

🚀 Quick Start

Prerequisites

Installation

Running the Application

📁 Project Structure

💡 Usage Guide

🔧 Available Models

🔍 Technical Details

Components

Process Flow

⚙️ Configuration Options

🛠️ Troubleshooting

📄 License

👥 Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages