image/PDF_scanner #8

livnugaraa · 2025-09-05T01:37:18Z

This PR introduces functionality for scanning media files, extracting text content using OCR, and managing file operations more robustly. It integrates three new/updated modules:

file_handler.py: Handles file input/output operations, validation, and error management.
ocr_engine.py: Provides OCR capabilities for extracting text from images and other supported formats.
scan_media.py: Implements the core logic for scanning media files, coordinating between file handling and OCR processing.

Key Changes

Implemented file handling utilities to safely load, save, and validate media files.
Added an OCR engine wrapper that abstracts text extraction and handles failures gracefully.
Built a media scanning pipeline that:
Improved error handling and logging for debugging and maintainability.
Modularised code to make each component reusable and testable.
Accepts image and other supported media types.
Uses ocr_engine to extract text content.
Provides structured results for downstream processing.

This scans pdf's and images and coverts them to text.

lperry022

Great work!

ben-AI-cybersec

LGTM too

ben-AI-cybersec

looks great, thanks Liv!

image/PDF_scanner

f4f9f21

This scans pdf's and images and coverts them to text.

lperry022 approved these changes Sep 15, 2025

View reviewed changes

ben-AI-cybersec approved these changes Sep 18, 2025

View reviewed changes

lperry022 assigned lperry022 and unassigned lperry022 Sep 22, 2025

ben-AI-cybersec approved these changes Nov 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

image/PDF_scanner #8

image/PDF_scanner #8

Uh oh!

livnugaraa commented Sep 5, 2025

Uh oh!

lperry022 left a comment

Uh oh!

ben-AI-cybersec left a comment

Uh oh!

ben-AI-cybersec left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

image/PDF_scanner #8

Are you sure you want to change the base?

image/PDF_scanner #8

Uh oh!

Conversation

livnugaraa commented Sep 5, 2025

Uh oh!

lperry022 left a comment

Choose a reason for hiding this comment

Uh oh!

ben-AI-cybersec left a comment

Choose a reason for hiding this comment

Uh oh!

ben-AI-cybersec left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants