RAG Augmented Chatbot with Sentiment analysis and Human in the loop escalation

Architecture

System Components

RAG Engine: FAISS vector database with OpenAI GPT-3.5-turbo
Sentiment Analysis: RoBERTa-based classification model
Document Processing: Multi-format ingestion (TXT, MD, PDF, DOCX)
Session Management: Persistent conversation state
Escalation System: Automated human handoff triggers

Sequence Diagram

sequenceDiagram
    participant U as User
    participant UI as Streamlit UI
    participant SA as Sentiment Analyzer
    participant RAG as RAG Engine
    participant VS as Vector Store
    participant LLM as OpenAI GPT-3.5
    participant ES as Escalation System

    U->>UI: Submit message
    UI->>SA: Analyze sentiment
    SA-->>UI: Return sentiment score
    
    alt Negative sentiment > 70%
        UI->>ES: Trigger escalation
        ES-->>UI: Escalation prompt
        UI-->>U: Offer human agent
    else Normal processing
        UI->>RAG: Process query
        RAG->>VS: Retrieve relevant documents
        VS-->>RAG: Return context chunks
        RAG->>LLM: Generate response with context
        LLM-->>RAG: Return AI response
        RAG-->>UI: Formatted response
        UI-->>U: Display response
        
        alt Low confidence response
            UI->>ES: Trigger escalation
            ES-->>UI: Escalation prompt
            UI-->>U: Offer human agent
        end
    end

Installation

Prerequisites

Python 3.8 or higher
OpenAI API key
Git

Setup

Clone repository:

git clone <repository-url>
cd grad_project

Install dependencies:

pip install -r requirements.txt

Configure environment variables in .env:

OPENAI_API_KEY=your_api_key_here

Create knowledge base directory:

mkdir -p data/documents

Launch application:

streamlit run clickatell_chatbot_single.py

Configuration

Environment Variables

Variable	Description	Required
`OPENAI_API_KEY`	OpenAI API authentication key	Yes

Model Configuration

The system uses the following models (configurable in clickatell_chatbot_single.py):

EMBEDDING_MODEL = "sentence-transformers/all-MiniLM-L6-v2"
CHAT_MODEL = "gpt-3.5-turbo"
CHUNK_SIZE = 600
CHUNK_OVERLAP = 80
SEARCH_RESULTS = 5

Sentiment Analysis Configuration

Sentiment thresholds in analyze_sentiment():

if sentiment["label"] == "negative" and sentiment["score"] > 0.7:
    escalation_reason = "negative_sentiment"

Project Structure

grad_project/
├── clickatell_chatbot_single.py    # Main application
├── README.md                       # Documentation
├── requirements.txt               # Dependencies
├── .env                          # Environment configuration
├── data/
│   └── documents/               # Knowledge base files
├── vector_store/               # FAISS index storage
└── components/
    └── ui/
        └── assets/
            └── logo.png        # Application logo

Core Functions

Document Processing

The load_documents_from_folder() function handles multi-format document ingestion:

def load_documents_from_folder():
    """Load all supported documents from data/documents folder."""
    # Supports .txt, .md, .pdf, .docx formats
    # Returns list of Document objects with metadata

Vector Store Management

The create_vector_store() function manages FAISS index creation and loading:

def create_vector_store():
    """Create or load FAISS vector store from documents folder."""
    # Handles index persistence and document chunking
    # Returns configured FAISS store

Conversation Chain

The create_chat_chain() function builds the RAG pipeline:

def create_chat_chain(vector_store):
    """Create the conversational RAG chain"""
    # Combines retriever, prompt template, and LLM
    # Returns RunnableWithMessageHistory instance

Sentiment Analysis

The analyze_sentiment() function processes user input:

def analyze_sentiment(text, session_id=None):
    """Analyze sentiment using RoBERTa model."""
    # Returns {"label": str, "score": float}
    # Handles preprocessing for social media text

Escalation Logic

The trigger_escalation() function manages human handoff:

def trigger_escalation(reason, session_id):
    """Generate appropriate escalation message based on trigger reason."""
    # Handles different escalation scenarios
    # Returns formatted escalation prompt

Dependencies

streamlit>=1.28.0
langchain>=0.1.0
langchain-community>=0.0.20
langchain-openai>=0.0.5
langchain-huggingface>=0.0.1
faiss-cpu>=1.7.4
transformers>=4.35.0
torch>=2.0.0
python-dotenv>=1.0.0
PyPDF2>=3.0.1
docx2txt>=0.8

Usage

Basic Operation

Start the application using streamlit run clickatell_chatbot_single.py
Access the interface at http://localhost:8501
Add knowledge base documents to data/documents/
Interact through the chat interface

Knowledge Base Management

Supported document formats:

Text files (.txt, .md)
PDF documents (.pdf)
Word documents (.docx)

Documents are automatically processed and indexed on application startup.

Escalation Triggers

The system triggers escalation under these conditions:

Negative sentiment with confidence > 70%
AI response contains knowledge limitation indicators
Processing errors occur

Troubleshooting

Common Issues

API Key Error

ValueError: OPENAI_API_KEY not found in environment variables

Solution: Verify .env file exists with valid API key

Document Loading Error

No documents found in data/documents folder

Solution: Create directory and add supported file formats

Vector Store Error

Failed to load vector store

Solution: Delete vector_store/ directory to force rebuild

Performance Optimization

Limit document size for faster processing
Adjust CHUNK_SIZE based on content complexity
Monitor OpenAI API rate limits and usage

Development

Code Structure

The application follows a modular architecture:

Configuration: Constants and environment setup
AI Components: RAG pipeline and sentiment analysis
UI Components: Streamlit interface elements
Main Application: Orchestration and message processing

Key Classes and Functions

initialize_embeddings(): HuggingFace embedding model setup
initialize_sentiment_analyzer(): RoBERTa sentiment model initialization
process_message(): Main message processing pipeline
main(): Application entry point

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.streamlit		.streamlit
components/ui/assets		components/ui/assets
data/documents		data/documents
rad-gp-c25-p-i6/logs		rad-gp-c25-p-i6/logs
vector_store/faiss_index		vector_store/faiss_index
.env		.env
DEVELOPER_DOCUMENTATION.md		DEVELOPER_DOCUMENTATION.md
README.md		README.md
clickatell_chatbot_single.py		clickatell_chatbot_single.py

Kayanja2023/RAG-CLICKATELL-AI-SENTIMENT-ANALYSIS

Folders and files

Latest commit

History

Repository files navigation

RAG Augmented Chatbot with Sentiment analysis and Human in the loop escalation

Architecture

System Components

Sequence Diagram

Installation

Prerequisites

Setup

Configuration

Environment Variables

Model Configuration

Sentiment Analysis Configuration

Project Structure

Core Functions

Document Processing

Vector Store Management

Conversation Chain

Sentiment Analysis

Escalation Logic

Dependencies

Usage

Basic Operation

Knowledge Base Management

Escalation Triggers

Troubleshooting

Common Issues

Performance Optimization

Development

Code Structure

Key Classes and Functions

User-Interface

Image 1: Neutral sentiment

Image 2: Positive sentiment

Image 3: Negative sentiment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages