Talk to Me

Welcome to the "Talk to Me" repository, a project designed to seamlessly integrate audio recording, speech recognition, text generation, and text-to-speech conversion. This software provides a dynamic and interactive audio experience.

Key Features

Audio Recording: Capture high-quality audio using PyAudio.
Speech Recognition: Convert recorded audio to text with Google's Speech Recognition API.
Text Generation: Utilize Google Generative AI to process and generate text content.
Text-to-Speech Conversion: Convert generated text to speech and save it as an MP3 file using gTTS.
Audio Playback: Play the generated audio using Pygame.

Technologies Used

PyAudio: For capturing and handling audio data.
Wave: To save recorded audio in WAV format.
SpeechRecognition: For converting audio to text using Google's Speech Recognition API.
gTTS: For converting text to speech and saving as an MP3 file.
Pygame: To play audio files.
Google Generative AI: Leverage advanced generative AI models for content generation.

How to Use

Clone the repository:

git clone https://github.com/Ankit1017/TalkToMe.git

The main script captures audio, converts it to text, processes the text with generative AI, and converts it back to audio. If the phrase "OK Google" is detected in the audio, the process halts; otherwise, it continues with further processing and playback.

Contributing We welcome contributions! Please feel free to submit issues and pull requests to improve this project.

License This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
build/app1		build/app1
icons		icons
uploads		uploads
LICENSE		LICENSE
README.md		README.md
app.spec		app.spec
main v1.py		main v1.py
main.py		main.py
output_1722316630.mp3		output_1722316630.mp3
output_1722316677.mp3		output_1722316677.mp3
output_1722316929.mp3		output_1722316929.mp3
output_1722317127.mp3		output_1722317127.mp3
output_1722322379.mp3		output_1722322379.mp3
output_1722333837.mp3		output_1722333837.mp3
output_1722334185.mp3		output_1722334185.mp3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Talk to Me

Key Features

Technologies Used

How to Use

About

Uh oh!

Releases

Packages

Languages

License

Ankit1017/TalkToMe

Folders and files

Latest commit

History

Repository files navigation

Talk to Me

Key Features

Technologies Used

How to Use

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages