Skip to content

Add unified MLX model format and audio-video generation improvements#18

Open
james-see wants to merge 5 commits intoBlaizzy:mainfrom
james-see:main
Open

Add unified MLX model format and audio-video generation improvements#18
james-see wants to merge 5 commits intoBlaizzy:mainfrom
james-see:main

Conversation

@james-see
Copy link

  • Add unified MLX model conversion that creates single model.safetensors
  • Support loading from unified MLX format in generate_av.py
  • Fix conversion bug that included quantized weights (fp4/fp8 scale keys)
  • Add local path support in get_model_path (tilde expansion)
  • Update README with MLX model conversion and usage instructions
  • Add soundfile dependency for audio processing
  • Add CLI entry points for generate_av and convert commands

The unified format provides:

  • Faster loading (single file vs multiple)
  • Pre-sanitized weights (no on-the-fly transformation)
  • Easy sharing via HuggingFace

James Campbell added 5 commits January 29, 2026 03:51
- Add unified MLX model conversion that creates single model.safetensors
- Support loading from unified MLX format in generate_av.py
- Fix conversion bug that included quantized weights (fp4/fp8 scale keys)
- Add local path support in get_model_path (tilde expansion)
- Update README with MLX model conversion and usage instructions
- Add soundfile dependency for audio processing
- Add CLI entry points for generate_av and convert commands

The unified format provides:
- Faster loading (single file vs multiple)
- Pre-sanitized weights (no on-the-fly transformation)
- Easy sharing via HuggingFace
- Add GitHub Actions workflow to publish to PyPI on tagged releases
- Export generate_video and generate_video_with_audio functions
- Bump version to 0.1.0 for first PyPI release
- Rename package from mlx-video to mlx-video-with-audio
- Update project URLs to correct repository
- Bump version to 0.1.1
- Update description to reflect video+audio generation capabilities
- Add keywords for better PyPI discoverability
- Update README with PyPI installation instructions
- Update repository URLs and image paths
- Bump version to 0.1.2
- Add stage parameter to denoise() and denoise_av() functions
- Emit STAGE:N:STEP:X:Y:Denoising to stderr for progress tracking
- Enables external apps to parse generation progress
- Bump version to 0.1.3
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant