🚀 Fast. Private. Local. – FileSeek is a lightweight AI-powered file archive and search tool that helps you organize and retrieve documents instantly using natural language.
It runs entirely on your machine, ensuring full privacy while giving you a cyber-style experience.
- ✅ Smart Search – Natural language search with semantic understanding
- ✅ Similar Document Finding – Discover related documents automatically
- ✅ AI-Powered OCR – Extract text from images and scanned PDFs
- ✅ Local-First – Runs fully offline for complete privacy
- ✅ Zero Config – Works out of the box with sensible defaults
- ✅ Real-time Monitoring – Auto-detects new and modified files
- ⚡ Blazing Fast – Semantic search in milliseconds
- 🔒 Privacy First – No cloud, no data sharing, fully local
- 🤖 AI-Powered – Advanced OCR and semantic understanding
- 🪶 Lightweight – Minimal dependencies, smooth performance
- 💻 Developer-Friendly – Clean CLI with rich terminal UI
Make sure you have these system packages installed:
Ubuntu/Debian:
sudo apt-get install tesseract-ocr poppler-utils libmagic1Fedora:
sudo dnf install tesseract poppler-utils file-libsmacOS:
brew install tesseract poppler libmagicWindows:
# Using Chocolatey (Run as Administrator)
choco install tesseract poppler libmagicAdd documents to the archive:
fileseek process -r /path/to/documentsSupports: PDFs, text files, images (with OCR), and scanned documents
Find documents using natural language:
fileseek search "find my notes on machine learning"Discover documents similar to a reference file:
fileseek similar /path/to/reference/fileAutomatically process new, modified, or deleted files:
fileseek watch /path/to/watchView all archived documents:
fileseek listgit clone https://github.com/yourusername/fileseek.git
pip install -e .Now you can use all commands directly:
fileseek process ~/Documents
fileseek search "find my course note on machine learning"
fileseek similar ~/Documents/project_plan.pdf
fileseek watch ~/Documents
fileseek list
fileseek validateFileSeek is zero-config by default but highly customizable:
# Set custom storage location
fileseek config set storage_path=~/FileSeekData
# Enable debug logging
fileseek config set logging.level=DEBUG
# Set OCR language (ISO 639-2 codes)
fileseek config set ocr.languages=["eng","fra"]Key Configuration Options:
storage_path: Where to store the document indexocr.languages: Languages for OCR processingsearch.max_results: Maximum number of search resultsmonitoring.watch_interval: File monitoring frequency