Hey Folks.
You're about to unlock OVI โ a super cool AI video tool available right now. This guide will transform you from zero to hero in minutes.
New video just dropped! Watch it first, then come back here to build your setup like a pro.
โก THE FAST LANE (Patreon Exclusive)
๐ 1-CLICK INSTALLER โ SKIP EVERYTHING BELOW
Why suffer through terminal commands when you don't have to?
๐ GET THE MAGIC BUTTON HERE
โจ No terminal. No headaches. No missing files.
Just pure, automated perfection.
โ Perfect for beginners or anyone who values their time
For the SAGE Attention Install guide look here: ๐๏ธ
https://www.patreon.com/posts/speed-up-comfyui-136348957
Want full control? Let's build this thing from scratch.
WhatWhy You Need ItLinkGitDownloads and manages code repositoriesDownload GitComfyUIYour AI workspace/command center
Your dashboard for managing all future extensions.
Bash
# Open CMD/Terminal in: ComfyUI\custom_nodes git clone https://github.com/Comfy-Org/ComfyUI-Manager.git
โ Pro Tip: This is your new best friend. Most other nodes can be installed through this manager with one click.
Install these custom nodes via ComfyUI Manager OR git clone:
๐ธ WanVideo Wrapper โ Core OVI functionality
https://github.com/kijai/ComfyUI-WanVideoWrapper
๐ธ VideoHelper Suite โ Video processing tools
https://github.com/Kosinkadink/ComfyUI-VideoHelperSuite
๐ธ Essentials โ Quality-of-life improvements
https://github.com/cubiq/ComfyUI_essentials
๐ธ Use Everywhere โ Workflow organization
https://github.com/chrisgoringe/cg-use-everywhere
๐ธ KJNodes โ Advanced node collection
https://github.com/kijai/ComfyUI-KJNodes
๐ธ rgthree โ Power user tools
https://github.com/rgthree/rgthree-comfy
Now for the good stuff โ the actual AI models.
๐ฅ 12-16 GB VRAM (Most RTX 3080/3090/4070Ti/4080 users)
Use the FP8 scaled versions โ optimized for speed & efficiency
๐ Place in: models\diffusion_models
Video Model:
text
Audio Model:
text
๐ช 16+ GB VRAM (RTX 4090/5090/Professional Cards)
Use the BF16 versions โ maximum quality, no compromises
๐ Place in: models\diffusion_models
Video Model:
text
Audio Model:
text
๐ Full Model Library: Browse here
๐ Place in: models\vae
Audio VAE:
text
https://huggingface.co/Kijai/WanVideo_comfy/resolve/main/Ovi/mmaudio_vae_16k_fp32.safetensors
Audio BIG VAE:
text
โ ๏ธ Critical: You must download BOTH audio files or sound generation won't work!
This is what translates your creative descriptions into AI instructions.
๐ Place in: models\text_encoders
Download:
text
https://huggingface.co/Kijai/WanVideo_comfy/resolve/main/umt5-xxl-enc-bf16.safetensors
The final piece โ converts AI data into beautiful visuals.
๐ Place in: models\vae
Download:
text
https://huggingface.co/Kijai/WanVideo_comfy/resolve/main/Wan2_2_VAE_bf16.safetensors
Before you launch ComfyUI, verify:
Git installed
โฌ๏ธWorkflows downloadedโฌ๏ธ
ComfyUI downloaded & extracted
ComfyUI Manager installed
All 6 custom nodes installed
Video + Audio models (matched to your VRAM)
Both audio VAE files
Text encoder
Visual VAE
Everything is locked and loaded. Fire up ComfyUI and let's make some magic.