DarkHole

A secure, high-performance PDF text extraction tool with a cosmic-inspired interface.

Visit darkhole.org →

Features

DarkHole extracts text from PDFs using multiple fallback methods to ensure maximum accuracy and reliability.

Why Choose DarkHole?

Lightning Fast - Process PDFs in seconds with optimized extraction engines
Secure & Private - Files processed locally with session isolation, automatic cleanup
High Accuracy - Multi-engine approach with OCR fallback for scanned documents

Advanced Technology

Multi-Engine Extraction:

PDFMiner for structural text extraction
PyMuPDF for complex layouts
OCR (Tesseract) for scanned documents

Smart Processing:

Automatic method selection based on PDF type
Resource limits and timeout protection
Comprehensive error handling

Performance Optimizations

Session-based file isolation prevents conflicts
Automatic cleanup of temporary files
Mobile-optimized responsive design
Security hardening with path validation

Quick Start

# Install dependencies
pip install -r requirements.txt

# Run the application
python app.py

Visit http://localhost:5000 to start extracting text from your PDFs.

Tech Stack

Backend: Flask, Python 3.11+
PDF Processing: PDFMiner, PyMuPDF, pdf2image
OCR: Tesseract, pytesseract
Frontend: Vanilla JS, CSS3 with animations
Deployment: Gunicorn, Render-ready

Security Features

Session-based file isolation
Path traversal protection
Input validation and sanitization
Resource limits and timeouts
Automatic temporary file cleanup

Mobile Support

Fully responsive design optimized for mobile devices with touch-friendly interactions and performance optimizations.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
docs/screenshots		docs/screenshots
static		static
templates		templates
.gitignore		.gitignore
.python-version		.python-version
Procfile		Procfile
README.md		README.md
app.py		app.py
cleanup_old_files.py		cleanup_old_files.py
pdf_extractor.py		pdf_extractor.py
render.yaml		render.yaml
requirements.txt		requirements.txt
robots.txt		robots.txt
sitemap.xml		sitemap.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DarkHole

Visit darkhole.org →

Features

Why Choose DarkHole?

Advanced Technology

Performance Optimizations

Quick Start

Tech Stack

Security Features

Mobile Support

About

Uh oh!

Releases

Packages

Languages

rigelshasani/DarkHole

Folders and files

Latest commit

History

Repository files navigation

DarkHole

Visit darkhole.org →

Features

Why Choose DarkHole?

Advanced Technology

Performance Optimizations

Quick Start

Tech Stack

Security Features

Mobile Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages