Add voice-assistant support & TUI demo #55

graniet · 2025-06-15T17:34:54Z

Adds first-class audio handling to the LLM stack and a runnable voice-assistant example:

New MessageType::Audio with helpers (has_audio, audio_data) and a cond!(has_audio) macro.
AgentBuilder / ChatWithMemory now accept an optional STT provider and launch an async transcription pipeline using Whisper.
Back-end updates (OpenAI, Anthropic, Google, Azure) gracefully ignore audio when building API payloads.
Memory tweaks: audio messages are filtered out of sliding-window recall to keep prompts clean.
Example agent_audio_example.rs: records PCM via cpal, encodes WAV with hound, shows live conversation in a ratatui/crossterm TUI, and drives a two-agent (transcriber → assistant) workflow.
Feature-gated dependencies (cpal, hound, ratatui, crossterm, anyhow, parking_lot) under audio-example to avoid bloat for non-audio builds.

reneleonhardt · 2025-06-19T10:08:38Z

Wow, great work @graniet 🚀
What is missing? Could a CI workflow be added to check for regressions?

Copilot

Pull Request Overview

This PR adds voice-assistant support and a terminal UI demo to the LLM stack. Key changes include:

Introducing a new audio message type with helper methods and a cond!(has_audio) macro.
Enhancing memory and chat wrappers to filter out and transcribe audio messages by integrating an optional STT provider.
Adding a runnable TUI-based voice-assistant example along with feature-gated audio dependencies.

Reviewed Changes

Copilot reviewed 13 out of 14 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
src/memory/sliding_window.rs	Filters out audio messages from the sliding window memory recall.
src/memory/mod.rs & cond_macros.rs	Adds the new HasAudio condition and helper documentation.
src/memory/chat_wrapper.rs	Integrates an STT provider to spawn a transcription pipeline for audio messages.
src/chat/mod.rs	Introduces new methods to work with the Audio messageType for audio processing.
src/builder.rs; src/agent/builder.rs	Updates agent and builder constructions to support optional STT providers.
src/backends/*	Updates backend implementations to gracefully ignore audio messages.
examples/agent_audio_example.rs	Provides a complete demo integrating recording, transcription, audio processing, and terminal UI.
Cargo.toml	Adds optional dependencies for audio processing and the TUI example.

Comments suppressed due to low confidence (1)

src/memory/sliding_window.rs:182

Consider adding a comment to explain why audio messages are filtered out in the sliding window recall, to clarify the intent for future maintainers.

        messages.retain(|m| !m.has_audio());

Copilot · 2025-06-21T18:26:05Z

src/memory/chat_wrapper.rs

+
+                            let mut guard = memory.write().await;
+                            if let Err(e) = guard.remember_with_role(&transcribed_msg, event.role.clone()).await {
+                                eprintln!("STT memory save error: {}", e);


Consider replacing eprintln with a proper logging framework to ensure consistent error handling across production code.

Copilot · 2025-06-21T18:26:06Z

examples/agent_audio_example.rs

+                                        writer.write_sample(s).unwrap();
+                                    }
+                                    writer.finalize().unwrap();


Consider handling errors explicitly when writing WAV samples instead of using unwrap to prevent potential panics during audio processing.

Suggested change

writer.write_sample(s).unwrap();

}

writer.finalize().unwrap();

if let Err(e) = writer.write_sample(s) {

eprintln!("Error writing WAV sample: {e}");

return Vec::new(); // Return an empty buffer on error

}

}

if let Err(e) = writer.finalize() {

eprintln!("Error finalizing WAV file: {e}");

return Vec::new(); // Return an empty buffer on error

}

feat(audio): Add audio example with voice assistant functionality

49d4d4d

graniet changed the title ~~Add speech to text feature for agent builder~~ Add voice-assistant support & TUI demo Jun 15, 2025

Merge branch 'main' into feat/chatmessage-audio

0a5546e

graniet requested a review from Copilot June 21, 2025 18:25

Copilot AI reviewed Jun 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add voice-assistant support & TUI demo #55

Add voice-assistant support & TUI demo #55

Uh oh!

graniet commented Jun 15, 2025 •

edited

Loading

Uh oh!

reneleonhardt commented Jun 19, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jun 21, 2025

Uh oh!

Copilot AI Jun 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-                                        writer.write_sample(s).unwrap();
-                                    }
-                                    writer.finalize().unwrap();
+                                        if let Err(e) = writer.write_sample(s) {
+                                            eprintln!("Error writing WAV sample: {e}");
+                                            return Vec::new(); // Return an empty buffer on error
+                                        }
+                                    }
+                                    if let Err(e) = writer.finalize() {
+                                        eprintln!("Error finalizing WAV file: {e}");
+                                        return Vec::new(); // Return an empty buffer on error
+                                    }

Add voice-assistant support & TUI demo #55

Are you sure you want to change the base?

Add voice-assistant support & TUI demo #55

Uh oh!

Conversation

graniet commented Jun 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

reneleonhardt commented Jun 19, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jun 21, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

graniet commented Jun 15, 2025 •

edited

Loading