Added `PyAVResampler` - a stateful resampler implementation #206

dangusev · 2025-12-20T12:59:21Z

Added a stateful resampler based on pyav

It is intended to be once for the audio track and reused.

Key differences from the stateless implementation:

It buffers samples internally, so the number of output samples doesn't always match the input.
PyAVResampler keeps its own monotonic PTS clock, and it's meant to be used with a single audio stream only.
It ignores the PTS/DTS from PcmData.
PTS always starts from 0 for the first output.
pyav.AudioResampler configures itself based on the first input frame. Feeding data in a different format
or sample rate will fail.
PyAVResampler is not thread-safe.

The source PCMs must have the same sample rate, format, and number of channels.

Summary by CodeRabbit

New Features
- Introduced advanced audio resampling with stateful processing, buffer flushing, and customizable output frame sizing
- Enhanced audio data handling with improved timestamp preservation and codec format conversions
Tests
- Added comprehensive test coverage for audio resampling scenarios including upsampling, downsampling, channel conversion, and edge cases

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-12-20T12:59:26Z

Walkthrough

The changes introduce PyAVResampler, a new stateful audio resampling class that wraps PyAV's AudioResampler with internal frame buffering and PTS clock management. PcmData is extended with an empty property and bidirectional PyAV frame conversion methods (to_av_frame with PTS assignment, from_av_frame for deserializing PyAV AudioFrames). Resampler.repr is updated to include the runtime class name. Comprehensive tests are added for the new resampler across multiple scenarios.

Changes

Cohort / File(s)	Summary
Core Resampling Implementation `getstream/video/rtc/track_util.py`	Introduces PyAVResampler class with stateful resample(pcm, flush=False) and flush() methods. Adds empty property to PcmData. Extends PcmData.to_av_frame() to attach pts information and PcmData.from_av_frame() to convert PyAV AudioFrames to PcmData with time_base and format handling. Updates Resampler.repr to use runtime class name. Expands imports and type annotations (AsyncIterator, Dict, Optional, Union, Fraction). Adds MULAW_DECODE_TABLE import for G.711 support.
Test Coverage `tests/rtc/test_pcm_data.py`	Adds comprehensive TestPyAVResampler test suite covering upsampling, downsampling, frame_size determinism, channel conversions, format conversions, PTS/time_base handling, empty input handling, flush behavior, cross-chunk consistency, and error cases.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

PyAVResampler state management and buffering: Verify that frame buffering, PTS tracking, and flush logic correctly drain internal PyAV state without data loss or leaks; confirm reset behavior after flush.
PcmData.from_av_frame() conversion logic: Check handling of multiple frame formats, time_base conversion using Fraction, and defensive handling for empty/malformed frames.
PTS and time_base consistency: Ensure PTS values are correctly propagated through to_av_frame and from_av_frame, and that time_base conversions are numerically sound.
Test coverage: Verify tests exercise edge cases (empty inputs, format mismatches, boundary conditions) and that no duplicate test blocks introduce redundancy.

Possibly related PRs

Audio utils #170: Also modifies getstream/video/rtc/track_util.py and restructures PcmData/resampling APIs; directly related code-level changes to the same resampling subsystem.
More audio utils #173: Adds stateless NumPy-based resampler and PcmData.from_av_frame support; complementary resampling path using same PcmData conversion APIs.

Poem

🐰 With frames now buffered and PTS held tight,
I resample audio through the PyAV night,
Stateful and wise, no data shall flee,
From PCM to PyAV, resampled with glee! 🎵✨

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 73.33% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Added `PyAVResampler` - a stateful resampler implementation' accurately describes the main change: introducing a new PyAVResampler class with stateful resampling capabilities, which aligns with the primary additions across both the implementation and test files.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feature/stateful-resampler

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

dangusev · 2025-12-20T13:00:00Z

@coderabbitai review

coderabbitai · 2025-12-20T13:00:06Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (2)

getstream/video/rtc/track_util.py (2)
489-494: Inconsistent shape for empty frames.

Empty frames create different array shapes for mono vs. multi-channel:

Mono: np.array([], dtype=dtype) → shape (0,)

Multi-channel: np.zeros((channels, 0), dtype=dtype) → shape (channels, 0)

For consistency, mono should also return a 2D array when channels == 1:
🔎 Proposed fix for consistent empty frame shapes
         # Handle empty frames
         if frame.samples == 0:
-            if channels == 1:
-                samples_array = np.array([], dtype=dtype)
-            else:
-                samples_array = np.zeros((channels, 0), dtype=dtype)
+            samples_array = np.zeros((channels, 0), dtype=dtype)
         else:
Then flatten to 1D for mono later in the normalization logic (line 522-524) which already handles this case.
1755-1759: Consider clearer condition for empty frame check.

The condition if frame is not None and not frame.samples: works but could be more explicit. Consider:
🔎 More explicit empty frame check
     def _pyav_resample(self, frame: av.AudioFrame | None) -> list[av.AudioFrame]:
-        if frame is not None and not frame.samples:
+        if frame is not None and frame.samples == 0:
             # pyav resampler fails if audioframe has no samples
             return []
         return self._pyav_resampler.resample(frame)
Using == 0 makes it clearer that we're checking for zero samples, not a falsy samples object.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 39b27c2 and 472edaa.

📒 Files selected for processing (2)

getstream/video/rtc/track_util.py (4 hunks)
tests/rtc/test_pcm_data.py (2 hunks)

🧰 Additional context used

📓 Path-based instructions (3)

**/*test*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*test*.py: Use fixtures to inject objects in tests
Keep tests well organized and use test classes for similar tests
Tests that rely on file assets should use files from tests/assets/ folder; new files should be added there, existing ones reused if possible
Do not use mocks or mock things in tests unless explicitly asked to

Files:

tests/rtc/test_pcm_data.py

**/*.py

📄 CodeRabbit inference engine (AGENTS.md)

Plugins that work with audio, video, or WebRTC functionality should depend on getstream[webrtc] instead of just getstream to access WebRTC-related dependencies like aiortc, numpy, torch, torchaudio, soundfile, scipy, deepgram-sdk, and elevenlabs

Files:

tests/rtc/test_pcm_data.py
getstream/video/rtc/track_util.py

**/test_*.py

📄 CodeRabbit inference engine (AGENTS.md)

**/test_*.py: Use fixtures to inject objects in tests; test client fixtures can use the Stream API client
Load credentials from .env file; the client fixture will load credentials automatically
Keep tests well organized and use test classes to group similar tests
Tests that rely on file assets should use files from tests/assets/ folder; add new files there and reuse existing ones when possible. Do not use files larger than 256 kilobytes
Do not use mocks or mock objects in tests unless directly requested

Files:

tests/rtc/test_pcm_data.py

🧬 Code graph analysis (1)

getstream/video/rtc/track_util.py (2)

getstream/video/rtc/recording.py (1)

AudioFrame (64-70)

getstream/video/rtc/audio_track.py (1)

flush (150-159)

🔇 Additional comments (7)

getstream/video/rtc/track_util.py (5)

811-811: LGTM!

Assigning PTS to the AudioFrame enables proper timestamp tracking when converting PcmData to PyAV frames, which is essential for the stateful resampler's time management.

1669-1672: LGTM!

The empty property provides a clean, intuitive way to check if PcmData contains any samples. This is especially useful for the resampler's flush behavior.

1708-1738: LGTM!

The initialization logic properly validates format strings and handles both string and enum format types. The PyAV format determination and internal state setup are correct.

1796-1802: LGTM!

The PTS clock management is correct. The logic tracks the next sample timestamp in self._pts and calculates the result's starting PTS by subtracting the result length, ensuring continuous monotonic timestamps across multiple resample calls.

2032-2032: LGTM!

Using self.__class__.__name__ instead of a hardcoded class name is a best practice that ensures subclasses (like PyAVResampler) display the correct class name in their string representation.

tests/rtc/test_pcm_data.py (2)

1979-2326: Excellent test coverage for PyAVResampler!

The test suite comprehensively covers:

Upsampling and downsampling scenarios

Frame size buffering behavior

Channel conversions (mono ↔ stereo)

Format conversions (s16 ↔ f32)

Real-time streaming simulation with 20ms chunks

PTS/DTS timestamp tracking and monotonic progression

Edge cases (empty frames, flush behavior)

Cross-chunk consistency

The tests are well-organized in a dedicated class and follow the existing test patterns in the file.

2042-2082: Good defensive testing for input validation.

These tests verify that the stateful resampler correctly rejects inconsistent inputs (different sample rates or formats after initialization), which is critical for preventing subtle bugs in streaming scenarios. Using pytest.raises with the match parameter ensures the error messages are clear.

Added PyAVResampler - a stateful resampler implementation

472edaa

coderabbitai bot reviewed Dec 20, 2025

View reviewed changes

dangusev requested a review from tbarbugli December 22, 2025 12:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added `PyAVResampler` - a stateful resampler implementation #206

Added `PyAVResampler` - a stateful resampler implementation #206

dangusev commented Dec 20, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Dec 20, 2025 •

edited

Loading

Uh oh!

dangusev commented Dec 20, 2025

Uh oh!

coderabbitai bot commented Dec 20, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Added PyAVResampler - a stateful resampler implementation #206

Are you sure you want to change the base?

Added PyAVResampler - a stateful resampler implementation #206

Conversation

dangusev commented Dec 20, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Dec 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Pre-merge checks and finishing touches

Uh oh!

dangusev commented Dec 20, 2025

Uh oh!

coderabbitai bot commented Dec 20, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Added `PyAVResampler` - a stateful resampler implementation #206

Added `PyAVResampler` - a stateful resampler implementation #206

dangusev commented Dec 20, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Dec 20, 2025 •

edited

Loading