What the Youtube Video if not clear https://www.youtube.com/watch?v=VPW6mVTTtTc
git clone https://github.com/oobabooga/text-generation-webui
cd text-generation-webui
pip install -r requirements.txt
python server.py2. Install the OpenAI API extension described in the document
Note: this modules doesn't need to be using OpenAI's API Key. It is just a OpenAI-API-Like module to deploy the LLM as a web-service.
Start the server, and select the model as usual:
python server.py --api --listenafter this you will get
INFO: Waiting for application startup.
INFO: Application startup complete.
INFO: Uvicorn running on http://0.0.0.0:5000 (Press CTRL+C to quit)
Running on local URL: http://0.0.0.0:7860
replace 0.0.0.0 with your elastic ip that you associated with your EC2 instances type the adress http://fill.in.your.own:7860 in to your web browser and download and load the model under the model tag
http://localhost:5000/docs
1. Download Qwen 2.5-7b model
you can download the zip into the MOdel directory of textgenwebui or
with model tag in the textgenwebui page:
Qwen/Qwen2.5-7b
and run Load
Remember to install the openAI API
// bash //python server.py --model open-thoughts_OpenThinker3-7B --api --listen //
python server.py --model Qwen_Qwen2.5-7B --api --listen --sharepython server.py --model Qwen_Qwen2.5-7B --api --listennow if you want to use n8n, you can have
python server.py --model open-thoughts_OpenThinker3-7B --public-api --listenand then paste the link into the HTTP get2