FEATURE REQUEST: VAE GGUF Loader

First, I want to say thanks @city96 for your GGUF loaders for CLIP and Model. They are a godsend.

[I am running a 7900XTX ROCm 7.1 Windows, and I found no way to accelerate FP8, they auto converts to BF16 acceleration that uses 2B per parameter and lots of VRAM.](https://github.com/OrsoEric/HOWTO-ComfyUI/blob/Master/logs/2026-01-11%20P313%20Zimage.md) And ROCM isn't good at VRAM, AMD is improving, slowly.

With your GGUF loader my 7900XTX is able to use the INT8 hardware acceleration, and finally get that memory footprint down and stabilize workflow execution. At least with Zimage it works so much better using GGUF with both CLIP and Model.

<img width="2630" height="2137" alt="Image" src="https://github.com/user-attachments/assets/894d4d0b-9e6d-4ca1-bd1b-e8e71458e178" />

# VAE Issues under ROCm

A persistent issue I have with ROCm acceleration, is poor performance on VAE decode. I'm under the impression VAE decode is almost instant under Nvidia CUDA, meaning nobody looked into doing GGUF quantization for the VAE model as far as I can tell, I looked into found no quants, those are FP32, FP16 or BF16 models.

On AMD ROCm, VAE decode is a slow and expensive step requiring lots of extra VRAM and causing RAM spillage.

On 6.4 I found a workaround, on 7.1 Flux and [Zimage Turbo VAE](https://huggingface.co/Comfy-Org/z_image_turbo/blob/main/split_files/vae/ae.safetensors) decode work, even if they spill into RAM even at moderate resolution 1024px.

[Qwen Image VAE ](https://huggingface.co/QuantStack/Qwen-Image-Edit-GGUF/blob/5cf642dd2b94af2a558ec06a9dde255c673e1fdf/VAE/Qwen_Image-VAE.safetensors) for some reasons requires much more RAM, even at 1024px it fills 24GB VRAM and 64GB RAM on my system and goes into OOM and segmentation fault.

# VAE GGUF Loader

Would it be possible to have VAE GGUF loader to feed the VAE encode and VAE decode nodes and do the GGUF quantization of VAE models?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEATURE REQUEST: VAE GGUF Loader #406

VAE Issues under ROCm

VAE GGUF Loader

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

FEATURE REQUEST: VAE GGUF Loader #406

Description

VAE Issues under ROCm

VAE GGUF Loader

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions