Anime Video Magic: Unleashing the Potential of WAN2.1 for Stunning Visuals
Transform 2D anime characters into dynamic videos with Alibaba's WAN2.1 model. Learn how to use this workflow to generate stunning videos with camera motion, and get started with your own Image-to-Video creations today!
Workflow Overview
Transform 2D anime characters into dynamic videos with Alibaba's WAN2.1 model. Learn how to use this workflow to generate stunning videos with camera motion, and get started with your own Image-to-Video creations today!
Content type: Workflow
Primary intent: Download
Required Models
- Wan2.1
Required Nodes
- Upscaler
Setup Notes
- Install the required models before opening the workflow template.
- Recommended hardware: Low VRAM (≤8GB).
1. Workflow Overview

This workflow uses Alibaba’s WAN2.1 model for Image-to-Video (I2V) generation, transforming a 2D anime-style character image (e.g., a girl cycling) into a dynamic video with camera motion.
Core Models:
Wan2.1-I2V-14B: Video generation model (480P output).UMT5-XXL: Multilingual text encoder for Chinese prompts.OpenCLIP-ViT-H: Image encoder for input analysis.
2. Key Nodes
Node | Function | Installation |
|---|---|---|
| Loads WAN2.1 model (requires | Manual download to |
| Processes bilingual prompts (e.g., "cycling scene") | Requires |
| Frame interpolation for smoother video | Install via ComfyUI Manager |
| Combines frames into MP4 | Install |
Dependencies:
Manual model downloads:
Wan2_1-I2V-14B-480P_fp8_e4m3fn.safetensorsumt5-xxl-enc-bf16.safetensorsSave to:
ComfyUI/models/wan_video/
3. Workflow Groups
Load Image: Input character image (e.g.,
00028-2257557710.png).Model Load: Load WAN2.1, VAE, and text encoders.
Prompt Processing: Positive/Negative prompts (e.g., "speed effect").
Video Generation:
WanVideoSamplergenerates latent frames.Post-Processing: Upscaling (
CR Upscale Image).Video Export: MP4 output (
VHS_VideoCombine).Frame Interpolation: Boosts FPS to 32 via
RIFE VFI.
4. Inputs & Outputs
Inputs:
Image: 480x768 PNG (anime character).
Prompts: Chinese/English descriptions.
Params: Frame rate (16→32FPS), CFG=8.0, Steps=30.
Output:
MP4 video (default:
xiao.mp4).
5. Notes
VRAM: ≥16GB GPU recommended (use
bf16for lower usage).Debugging:
Missing model → Check
wan_videofolder path.RIFE error → Ensure
rife47.pthis inComfyUI/models/rife/.
Optimization:
Reduce resolution (e.g., 360P) for lower VRAM.
Disable experimental features to speed up.