This repository contains code for building a chatbot for the College of Artificial Intelligence (CAI) website. Two versions of the chatbot are implemented: one in Arabic using the AraBERT model and the other in English using the BERT model.
The project follows the following steps:
- Load the data.
- Exploratory Data Analysis (EDA) and analysis of the data for insights.
- Split the data into training, testing, and validation datasets.
- Load the BERT pre-trained model and tokenizer.
- Transform the data into numerical format.
- Train the BERT pre-trained model on the custom dataset.
- Evaluate the model.
- Save the model.
- Load the model and start the conversation.
- TensorFlow (2.x).
- Transformers library.
- Flask (for deployment).
To run the code, make sure you have the necessary packages installed:
pip install -U accelerate transformers xformers