Skip to content

feat: support Qwen2D VAE#1714

Open
stduhpf wants to merge 1 commit into
leejet:masterfrom
stduhpf:qwen2d-vae
Open

feat: support Qwen2D VAE#1714
stduhpf wants to merge 1 commit into
leejet:masterfrom
stduhpf:qwen2d-vae

Conversation

@stduhpf

@stduhpf stduhpf commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

Summary

Wan2.1 VAE, but 3D convolutions are pruned into 2D convs for decoding single frames.

Download Link: https://huggingface.co/Anzhc/Qwen2D-VAE

(tensor naming convention in this file is not the same as the one currently supported in stable-diffusion.cpp, fixed by #1713)

Decoding only for now, I ran out of time.

Related Issue / Discussion

#1610

Additional Information

Qwen2D-VAE:
[DEBUG] ggml_extend.hpp:2007 - wan_vae compute buffer size: 4563.99 MB(VRAM)
[DEBUG] model_manager.cpp:727 - model manager releasing compute params ( 48.28 MB, 104 tensors) from ROCm0
[DEBUG] vae.hpp:215 - computing vae decode graph completed, taking 2.02s

qwen_image_vae:
[DEBUG] ggml_extend.hpp:2007 - wan_vae compute buffer size: 7420.44 MB(VRAM)
[DEBUG] model_manager.cpp:727 - model manager releasing compute params (136.46 MB, 104 tensors) from ROCm0
[DEBUG] vae.hpp:215 - computing vae decode graph completed, taking 5.84s

Checklist

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant