EbroBot: Your Intelligent LLM Chatbot

Overview

EbroBot is a powerful, interactive chatbot application built using state-of-the-art language models. It incorporates the capabilities of OpenAI's GPT-4o-mini and Chroma vector database to deliver context-aware, high-quality responses while leveraging uploaded documents as a knowledge base. The project is designed to simplify human-computer interaction through natural language processing and knowledge retrieval.

Project Structure

1. `chatbot.py`

This script serves as the main entry point for the chatbot interface. It:

Configures and initializes the chatbot's language model (ChatOpenAI) and vector database (Chroma).
Sets up file upload functionality to extend the chatbot's knowledge base dynamically.
Implements a Gradio-based user interface for seamless interaction.

2. `ingest_database.py`

This script prepares the knowledge base by:

Loading documents (e.g., PDFs) from the data directory.
Splitting the documents into manageable chunks for efficient processing.
Embedding the chunks using OpenAIEmbeddings and storing them in the Chroma vector database.

Features

1. Chatbot with Enhanced Retrieval

Streamed Responses: Generates partial answers while processing longer queries for a real-time experience.
Context-Aware Conversations: Leverages conversation history to provide coherent responses.
Knowledge-Driven Answers: Responds based solely on the knowledge provided by the uploaded documents.

2. File Upload

Users can upload text-based files to dynamically enhance the chatbot's knowledge base.
Documents are processed, embedded, and stored in the vector database for retrieval.

3. Document Ingestion

Handles large datasets with efficient chunking and embedding strategies.
Ensures persistence of the vector database for continuous use.

Prerequisites

1. Environment Setup

Python 3.8+
Install dependencies using:
```
pip install -r requirements.txt
```

2. API Keys

Ensure that the OpenAI API key is set up in a .env file:
```
OPENAI_API_KEY=your_openai_api_key
```

Usage

1. Ingest Documents

To process and store documents in the vector database:

python ingest_database.py

2. Launch the Chatbot

Start the Gradio-based chatbot interface:

python chatbot.py

The chatbot will be accessible at the URL provided by Gradio (e.g., http://127.0.0.1:****).

Directory Structure

.
├── data/                  # Directory for documents to be ingested
├── chroma_db/             # Persistent Chroma vector database
├── chatbot.py             # Main chatbot script
├── ingest_database.py     # Script to ingest documents
├── .env                   # Environment variables
├── requirements.txt       # Python dependencies

Dependencies

Future Enhancements

Support for additional file formats (e.g., Word, CSV).
Advanced conversational memory to handle complex dialogs.
Deployment on cloud platforms for scalability.

Contributing

Contributions are welcome! Please follow the standard GitHub workflow:

Fork the repository.
Create a feature branch.
Commit your changes.
Submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

OpenAI for providing the language model API.
LangChain for the robust integration framework.
The open-source community for their contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.env		.env
README.md		README.md
chatbot.py		chatbot.py
ingest_database.py		ingest_database.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EbroBot: Your Intelligent LLM Chatbot

Overview

Project Structure

1. `chatbot.py`

2. `ingest_database.py`

Features

1. Chatbot with Enhanced Retrieval

2. File Upload

3. Document Ingestion

Prerequisites

1. Environment Setup

2. API Keys

Usage

1. Ingest Documents

2. Launch the Chatbot

Directory Structure

Dependencies

Future Enhancements

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

burninggasontheblock/ebrobot

Folders and files

Latest commit

History

Repository files navigation

EbroBot: Your Intelligent LLM Chatbot

Overview

Project Structure

1. chatbot.py

2. ingest_database.py

Features

1. Chatbot with Enhanced Retrieval

2. File Upload

3. Document Ingestion

Prerequisites

1. Environment Setup

2. API Keys

Usage

1. Ingest Documents

2. Launch the Chatbot

Directory Structure

Dependencies

Future Enhancements

Contributing

License

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. `chatbot.py`

2. `ingest_database.py`

Packages