A free, open source, and extensible speech-to-text application that works completely offline.
Meetdy is a cross-platform desktop application built with Tauri (Rust + React/TypeScript) that provides simple, privacy-focused speech transcription. Press a shortcut, speak, and have your words appear in any text field—all without sending your voice to the cloud.
- Free: Accessibility tooling belongs in everyone's hands, not behind a paywall
- Open Source: Together we can build further. Extend Meetdy for yourself and contribute to something bigger
- Private: Your voice stays on your computer. Get transcriptions without sending audio to the cloud
- Simple: One tool, one job. Transcribe what you say and put it into a text box
- Press a configurable keyboard shortcut to start/stop recording (or use push-to-talk mode)
- Speak your words while the shortcut is active
- Release and Meetdy processes your speech using Whisper
- Get your transcribed text pasted directly into whatever app you're using
The process is entirely local:
- Silence is filtered using VAD (Voice Activity Detection) with Silero
- Transcription uses your choice of models:
- Whisper models (Small/Medium/Turbo/Large) with GPU acceleration when available
- Parakeet V3 - CPU-optimized model with excellent performance and automatic language detection
- Works on Windows, macOS, and Linux
- Download the latest release from the releases page
- Install the application following platform-specific instructions
- Launch Meetdy and grant necessary system permissions (microphone, accessibility)
- Configure your preferred keyboard shortcuts in Settings
- Start transcribing!
For detailed build instructions including platform-specific requirements, see BUILD.md.
Prerequisites:
# Install dependencies
bun install
# Run in development mode
bun run tauri dev
# Build for production
bun run tauri buildMeetdy is built as a Tauri application combining:
- Frontend: React + TypeScript with Tailwind CSS for the settings UI
- Backend: Rust for system integration, audio processing, and ML inference
- Core Libraries:
whisper-rs: Local speech recognition with Whisper modelstranscription-rs: CPU-optimized speech recognition with Parakeet modelscpal: Cross-platform audio I/Ovad-rs: Voice Activity Detectionrdev: Global keyboard shortcuts and system eventsrubato: Audio resampling
Meetdy includes an advanced debug mode for development and troubleshooting. Access it by pressing:
- macOS:
Cmd+Shift+D - Windows/Linux:
Ctrl+Shift+D
- macOS (both Intel and Apple Silicon)
- x64 Windows
- x64 Linux
For Whisper Models:
- macOS: M series Mac, Intel Mac
- Windows: Intel, AMD, or NVIDIA GPU
- Linux: Intel, AMD, or NVIDIA GPU (Ubuntu 22.04, 24.04)
For Parakeet V3 Model:
- CPU-only operation - runs on a wide variety of hardware
- Minimum: Intel Skylake (6th gen) or equivalent AMD processors
- Performance: ~5x real-time speed on mid-range hardware
- Check existing issues at github.com/chaudl113/meetdy/issues
- Fork the repository and create a feature branch
- Test thoroughly on your target platform
- Submit a pull request with clear description of changes
MIT License - see LICENSE file for details.
- Whisper by OpenAI for the speech recognition model
- whisper.cpp and ggml for amazing cross-platform whisper inference/acceleration
- Silero for great lightweight VAD
- Tauri team for the excellent Rust-based app framework
- Handy - The original project this was forked from
"Your search for the right speech-to-text tool can end here—not because Meetdy is perfect, but because you can make it perfect for you."