Voice Navigator

An AI-powered voice assistant designed to help motor-impaired individuals perform digital tasks through natural voice commands.

Overview

Voice Navigator is an accessibility-focused application that enables users with motor impairments to control their computer and perform various digital tasks using only their voice. The system leverages advanced speech recognition, natural language understanding, and text-to-speech technologies to provide a seamless hands-free computing experience.

Features

Voice-Activated Control: Perform tasks using natural voice commands
Speech Recognition: Powered by Whisper for accurate transcription
Natural Language Understanding: Intelligent intent parsing to understand user commands
Text-to-Speech Feedback: Audible responses to confirm actions and provide feedback
System Automation: Control various system functions hands-free
Web Automation: Navigate and interact with web content using voice commands

Architecture

The project is organized into modular components:

STT (Speech-to-Text): Captures audio input and transcribes speech using Whisper
NLU (Natural Language Understanding): Parses user intent from transcribed text
Automation: Executes system commands and web automation tasks
TTS (Text-to-Speech): Provides audio feedback to the user
Config: Centralized configuration and settings management

Installation

Clone the repository:

git clone https://github.com/andvvo/voice-navigator.git
cd voice-navigator

Install dependencies:

pip install -r requirements.txt

Usage

Run the voice assistant:

python src/main.py

Speak your commands naturally, and the assistant will transcribe, interpret, and execute them while providing audio feedback.

Requirements

See requirements.txt for the full list of dependencies.

Project Structure

src/
├── main.py              # Main application entry point
├── stt/                 # Speech-to-text module
├── nlu/                 # Natural language understanding
├── automation/          # Task automation and execution
├── tts/                 # Text-to-speech feedback
└── config/              # Configuration management

Accessibility

This project is specifically designed to improve digital accessibility for individuals with motor impairments, enabling them to:

Navigate their computer without physical input devices
Execute complex tasks through simple voice commands
Receive audio confirmation of actions
Maintain independence in digital environments

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice Navigator

Overview

Features

Architecture

Installation

Usage

Requirements

Project Structure

Accessibility

About

Uh oh!

Releases

Packages

Languages

andvvo/voice-navigator

Folders and files

Latest commit

History

Repository files navigation

Voice Navigator

Overview

Features

Architecture

Installation

Usage

Requirements

Project Structure

Accessibility

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages