Transcriber

AI transcription without the cloud.
An open-source web app powered by Whisper that lets you transcribe audio and video locally — private, fast, and developer-friendly.

Developed by Brendown Ferreira.

Easily upload local files or paste a YouTube URL, watch a sleek progress overlay, and download your transcript as .txt. Built with FastAPI, Jinja2, Tailwind, and Alpine.js — lightweight, modern, and easy to hack on.

✨ Features

🎥 Transcribe from YouTube URLs or local audio/video files
⚡ Two Whisper backends with automatic fallback:
- openai-whisper (preferred when FFmpeg is available)
- faster-whisper (fallback, works even without FFmpeg)
🚀 GPU acceleration (CUDA) when available, seamless CPU fallback otherwise
🔎 Robust FFmpeg detection (PATH and imageio-ffmpeg)
🌓 Clean UI with dark/light mode, progress bar, and smooth overlay
📜 Transcript history with one-click download
🧹 Temporary media cleanup after transcription
✅ Health check endpoint for quick status

🛠 Tech Stack

Backend: FastAPI, Uvicorn
Frontend: Jinja2 templates, Tailwind (CDN), Alpine.js
Media & ML: yt-dlp, Whisper (openai-whisper + faster-whisper), PyTorch
Utilities: python-dotenv, pydantic

📂 Project Structure

app/
 ├─ main.py                # FastAPI app + router registration
 ├─ core/
 │   ├─ config.py          # Environment + directories
 │   └─ theme.py           # Default theme + template globals
 ├─ routers/
 │   ├─ home.py            # GET /
 │   ├─ upload.py          # POST /transcribe/*
 │   └─ history.py         # GET /history
 ├─ services/
 │   ├─ youtube.py         # YouTube download logic
 │   ├─ transcriber.py     # FFmpeg detection + Whisper backends
 │   └─ file_manager.py    # File save/load/transcript listing
 ├─ templates/             # Jinja2 templates (UI)
 └─ static/                # CSS, JS, etc.
storage/
 ├─ uploads/
 └─ transcriptions/
tests/                     # End-to-end validation scripts

⚙️ Requirements

Python 3.10+
FFmpeg (optional but recommended)
NVIDIA GPU + CUDA (optional, for acceleration)

On Windows, requirements.txt pins the CUDA 12.4 wheel index for PyTorch. CPU-only mode works out of the box.

🚀 Quickstart

Windows (PowerShell):

python -m venv .venv
.\.venv\Scripts\Activate.ps1
pip install -r requirements.txt --upgrade --no-cache-dir
python -m uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

macOS/Linux:

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt --upgrade --no-cache-dir
uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

Then open: http://localhost:8000

🐳 Run with Docker

docker build -t transcribe-hub .
docker run --rm -p 8000:8000 transcribe-hub

For GPU support, enable NVIDIA Container Toolkit.

📖 Usage

Go to / → paste a YouTube URL or upload a file
Watch the progress overlay
View transcript + download as .txt
Visit /history to browse past transcripts

Endpoints:

GET / → UI
POST /transcribe/youtube → YouTube → transcript
POST /transcribe/upload → Local file → transcript
GET /history → List transcripts
GET /download/{filename} → Download transcript
GET /health → Health check

🧪 Testing

Run all tests with:

pytest -q

Key tests include:

test_imports.py → ML/media library checks
test_download.py → yt-dlp validation
test_transcribe_direct.py → Direct transcription flow
test_call_api.py → API endpoint validation

🤝 Contributing

Contributions are welcome! 🎉 Whether it’s bug fixes, features, or docs:

Fork the repo
Create a branch (feature/my-idea)
Commit & push
Open a Pull Request

🧩 Troubleshooting

FFmpeg not found: install FFmpeg or rely on imageio-ffmpeg fallback
GPU not used: check CUDA drivers and run torch.cuda.is_available()
Large files slow: progress bar is an estimate; consider SSE/WebSockets for real-time updates

📜 License

Open Source — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
app		app
storage		storage
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
preview.png		preview.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transcriber

✨ Features

🛠 Tech Stack

📂 Project Structure

⚙️ Requirements

🚀 Quickstart

🐳 Run with Docker

📖 Usage

🧪 Testing

🤝 Contributing

🧩 Troubleshooting

📜 License

About

Uh oh!

Releases

Packages

Languages

Br3n0k/transcriber

Folders and files

Latest commit

History

Repository files navigation

Transcriber

✨ Features

🛠 Tech Stack

📂 Project Structure

⚙️ Requirements

🚀 Quickstart

🐳 Run with Docker

📖 Usage

🧪 Testing

🤝 Contributing

🧩 Troubleshooting

📜 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages