How much VRAM is recommended?

Unlock dynamic video generation with the Wan2.1-I2V-14B model! Learn how to create stunning "Avatar Summoning" effects with text prompts, input images, and custom LoRAs. Discover the workflow, core models, and key nodes to get started

Use Case: Video
Best For: Video
Models: Wan2.1
VRAM: Low VRAM (≤8GB)
Reading Time: 3 min

View Required Models More Video Workflows

Workflow Overview

Content type: Workflow

Primary intent: Download

Required Models

Wan2.1

Setup Notes

Install the required models before opening the workflow template.
Recommended hardware: Low VRAM (≤8GB).

1. Workflow Overview

ma2c2n22h9uer7oa44p2917e9a1ef66f5047bad22ad6b534bbf1c8b30d3643181f48bfbd47e676683ac.gif

This workflow leverages the Wan2.1-I2V-14B model to generate dynamic videos with "Avatar Summoning" effects (e.g., semi-transparent phantom synchronized with character movements). It combines text prompts + input images and custom LoRAs (e.g., spell effects).

2. Core Models

Wan2.1-I2V-14B-480P_fp8_e4m3fn.safetensors
- Main model for video generation (image-to-video). Requires BF16 precision.
umt5-xxl-enc-bf16.safetensors
- T5 text encoder for processing complex prompts (supports Chinese).
Wan2.1_VAE_bf16.safetensors
- Decodes latent frames to images.

3. Key Nodes

WanVideoModelLoader
- Loads the main model. Manual download required (place in ComfyUI/models/wan_video).
WanVideoTextEncode
- Processes text prompts (positive/negative) using T5.
WanVideoSampler
- Uses DPM++ SDE sampler (25 steps default).
WanVideoLoraSelect
- Applies custom LoRAs (e.g., Avatar Summoning_beta).
VHS_VideoCombine
- Renders frames into MP4 (16 FPS).

4. Workflow Structure

Input Group
- Text prompts (e.g., "A woman swings a sword, summoning a purple phantom").
- Reference image (e.g., "修仙女子.png").
Generation Group
- Model initialization via WanVideoModelLoader and WanVideoVAELoader.
- Frame generation via WanVideoSampler.
Output Group
- Video synthesis with VHS_VideoCombine (480x832 resolution).

5. Inputs & Outputs

Inputs: Text prompts, image, seed (e.g., 1057359483639287).
Outputs: MP4 video (H.264, with metadata).

6. Notes

Dependencies: Manually download Wan2.1 models and LoRAs.
VRAM: 16GB+ GPU recommended. Use BF16 to reduce usage.
Compatibility: Requires ComfyUI-WanVideoWrapper (install via ComfyUI Manager).
Troubleshooting:
- FileNotFoundError if models are missing.
- Reduce resolution in WanVideoBlockSwap for CUDA OOM errors.

FAQ

Related Workflows

Related by Use Case

Unlock Anime-Style Video Magic: A Step-by-Step WAN2.1 Workflow Guide

Generate Anime-Style Videos with WAN2.1 Model: Learn how to convert input videos to anime style with dynamic prompts, HunyuanLoom technology, and outputs 16fps MP4 videos. Try this workflow now!

Transform Your Videos into Stylized Animations with Advanced AI Technology

Unlock the power of video stylization with our workflow! Transform input videos into stunning animations using Wan2.1 model, AnimeLineArt, and DepthAnything. Discover how to harness ControlNet, T5 text encoding, and frame interpolation for dynamic content. Learn more and get started now!

Wan2.7 Is Now Available in ComfyUI via Partner Nodes

Wan2.7 is a comprehensive upgrade over 2.6 — better quality, audio, dynamics, and a full suite of creative workflows now available in ComfyUI via Partner Nodes

Create Stunning Animated Videos with Ease: A Flux.1 and WanVideo Tutorial

Generate stunning images and videos with Flux.1 and WanVideo plugins. Learn how to integrate these models for high-quality image and video creation. Get started now!

Related by Model

Unlock Anime-Style Video Magic: A Step-by-Step WAN2.1 Workflow Guide

Generate Anime-Style Videos with WAN2.1 Model: Learn how to convert input videos to anime style with dynamic prompts, HunyuanLoom technology, and outputs 16fps MP4 videos. Try this workflow now!

Transform Your Videos into Stylized Animations with Advanced AI Technology

From Images to Videos: A Deep Dive into the Wan2.1-I2V Workflow

Unlock AI-powered video generation with Alibaba's Wan2.1 model! Learn how to create stunning videos from static images using this workflow guide.

Unlock the Power of Text-to-Video Generation with Aliyun's Wan2.1 Model

Generate dynamic videos with text prompts using Aliyun's Wan2.1 model! Learn how to utilize this Text-to-Video workflow with Chinese support, customizable frame rates, and resolutions. Discover the core models, key nodes, and workflow structure.

Looking for more Video workflows? Browse the Video hub for additional templates and guides.

Unlock Advanced Lighting Optimization: A Step-by-Step Workflow for Stunning Images

From Brushstrokes to Pixels: A Deep Dive into Stable Diffusion's Graffiti Capabilities

Summary

Chapter

workflow:

CustomNodes:

WanVideoDecode WanVideoModelLo...

workflow

Unlock the Power of Wan2.1: Dynamic Video Generation with Avatar Summoning Effects