🎙️ Comic-to-Audio Converter

Convert your comic book panels into speech using OCR and Text-to-Speech (TTS) technology!

This project automatically detects panels in a comic image, extracts text using EasyOCR, and generates corresponding audio using gTTS. Finally, it merges the panel audios into a single voiceover file for a seamless listening experience.

📌 Features

📖 Comic Panel Detection
Automatically splits comic pages into individual panels using image processing.
🔍 Text Extraction
Extracts English text from each panel using EasyOCR.
🎤 Text-to-Speech
Converts extracted text to audio using Google Text-to-Speech (gTTS).
🎧 Audio Compilation
Combines all panel audio files into one, with pauses between them.

🛠️ Installation

Ensure you're using a Python environment like Google Colab or Jupyter Notebook. Then install the dependencies:

pip install easyocr opencv-python numpy matplotlib gTTS pydub

Also, install FFmpeg for audio processing via pydub. In Google Colab, run:

!apt install ffmpeg

🚀 How to Use

➕ Add Your Comic Image

Place your comic image in the working directory.
Update the path in the code:

image_path = "Comic3.jpg"

▶️ Run the Main Process

This will:

✅ Detect comic panels
✅ Extract text from each panel
✅ Generate TTS for each panel
✅ Save panel audios
✅ Merge them into a single audio file

🧠 How It Works

🖼️ Panel Detection

Converts comic image to binary using thresholding.
Identifies white spaces to segment panels.

👁️ OCR

Uses EasyOCR to extract text from each panel image.

🔊 TTS

Uses Google Text-to-Speech (gTTS) to convert text to MP3 files.

🎚️ Audio Merging

Uses pydub to concatenate all MP3s with short pauses in between.

📦 Dependencies

easyocr
opencv-python
numpy
matplotlib
gTTS
pydub

System Dependency:

ffmpeg (external system dependency)

📄 License

This project is licensed under the MIT License.

🙌 Acknowledgments

Comic images used are for demonstration purposes only.
OCR by EasyOCR
TTS powered by gTTS

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Comic Panel.jpg		Comic Panel.jpg
Comic_Studies_Lab.ipynb		Comic_Studies_Lab.ipynb
README.md		README.md
combined_comic_audio.mp3		combined_comic_audio.mp3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ Comic-to-Audio Converter

📌 Features

🛠️ Installation

🚀 How to Use

➕ Add Your Comic Image

▶️ Run the Main Process

🧠 How It Works

🖼️ Panel Detection

👁️ OCR

🔊 TTS

🎚️ Audio Merging

📦 Dependencies

📄 License

🙌 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Mayank471/image2speech-comics

Folders and files

Latest commit

History

Repository files navigation

🎙️ Comic-to-Audio Converter

📌 Features

🛠️ Installation

🚀 How to Use

➕ Add Your Comic Image

▶️ Run the Main Process

🧠 How It Works

🖼️ Panel Detection

👁️ OCR

🔊 TTS

🎚️ Audio Merging

📦 Dependencies

📄 License

🙌 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages