Skip to content

uxdreaming/Text-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

📸 Text-Extractor (Linux)

A lightning-fast, PowerToys-inspired text extractor for Linux. Capture any area of your screen and instantly copy the text within to your clipboard.

Built specifically for Linux users using Flameshot for selection and Tesseract OCR for recognition.

License Python Platform

✨ Features

  • Seamless Selection: Uses the powerful Flameshot GUI for precision cropping.
  • Fast OCR: Powered by Tesseract OCR engine.
  • Clipboard Integration: Automatically copies extracted text—ready to paste anywhere.
  • Minimalist: No heavy GUI, just functionality. No annoying notifications.

🚀 Installation

1. Requirements

Ensure you have the following system dependencies installed:

# Ubuntu/Debian
sudo apt install tesseract-ocr flameshot xclip

# Arch Linux
sudo pacman -S tesseract flameshot xclip

# Fedora
sudo dnf install tesseract flameshot xclip

2. Setup

Clone this repository and set up the environment:

git clone https://github.com/YOUR_USERNAME/Text-extractor.git
cd Text-extractor
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

⌨️ Usage

Run the script manually:

./venv/bin/python ocr_extractor.py

Integration with i3wm

Add this line to your ~/.config/i3/config:

bindsym $mod+less exec --no-startup-id /path/to/venv/bin/python /path/to/ocr_extractor.py

🛠️ Configuration

The script uses Tesseract's default language (English). To add more languages (e.g., Spanish):

  1. Install the language pack: sudo apt install tesseract-ocr-spa
  2. Modify ocr_extractor.py:
    text = pytesseract.image_to_string(image, lang='eng+spa')

📜 License

Distributed under the MIT License. See LICENSE for more information.


Maintained by uxdreaming

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages