Unlock Anime-Style Video Magic: A Step-by-Step WAN2.1 Workflow Guide
Generate Anime-Style Videos with WAN2.1 Model: Learn how to convert input videos to anime style with dynamic prompts, HunyuanLoom technology, and outputs 16fps MP4 videos. Try this workflow now!
Workflow Overview
Generate Anime-Style Videos with WAN2.1 Model: Learn how to convert input videos to anime style with dynamic prompts, HunyuanLoom technology, and outputs 16fps MP4 videos. Try this workflow now!
Content type: Workflow
Primary intent: Download
Required Models
- Wan2.1
Setup Notes
- Install the required models before opening the workflow template.
- Recommended hardware: Low VRAM (≤8GB).
- Use the download button above to import the workflow JSON into ComfyUI.
1. Workflow Overview

This is an anime-style video generation workflow based on WAN2.1 model (DaKai optimized), featuring:
Convert input video (e.g.,
dance.mp4) to anime styleDynamic prompts for character details (19yo Chinese schoolgirl dancing)
HunyuanLoom technology for motion coherence
Outputs 16fps MP4 video (H.264 encoded)
2. Core Models
Model Name | Description |
|---|---|
wan2.1_t2v_1.3B_fp16 | Main video generation model (1.3B params) |
umt5_xxl_fp16 | Multilingual CLIP text encoder |
wan_2.1_1.3b_vae | Lightweight VAE for color accuracy |
3. Key Components
Special Nodes:
VHS_LoadVideo: Frame extraction (skips 120 frames, keeps 81)
HYFlowEditGuiderCFG: Dynamic CFG guidance (CFG=7.5)
SamplerCustomAdvanced: Advanced sampler (16 steps, simple scheduler)
VAEDecodeTiled: Tiled decoding (512x512 tiles, 64px overlap)
Installation:
Video Helper Suite: Install via ComfyUI Manager
HunyuanLoom: Manual install from GitHub
WAN2.1 Models: Download separately to
modelsfolder
4. Workflow Structure
Group 1: Load Models
Load UNET/CLIP/VAE (
wan2.1_t2v_1.3B_fp16+umt5_xxl_fp16)Apply
ApplyTeaCachePatch(strength=0.1) for acceleration
Group 2: Prompts
Positive Prompt: Detailed character/scene description (amber eyes, JK uniform, beach sunset)
Negative Prompt: 60+ filters for realism/low-quality
FluxGuidanceboosts prompt weights
Group 3: Video Input
Resize input to 832x480 (
nearest-exact)Tiled VAE encoding
Group 4: Sampling & Output
Generate latent with
HYFlowEditSampler(seed=123478)Tiled decode + video render (CRF=19, yuv420p)
5. Inputs & Outputs
Inputs:
Video file (default:
dance.mp4)Pre-configured prompts
Frame rate (16fps)
Outputs:
MP4 video (e.g.,
hyloom_00003.mp4)Optional intermediate frames
6. Notes
VRAM: 16GB+ recommended (1.3B model is VRAM-heavy)
Video Specs: Input ≥1280x720, ≤30sec duration
Motion Tuning: Adjust
motion_coherenceinHYFlowEditGuiderCFGif flickering occursCommon Error: Missing
umt5_xxl_fp16breaks CLIP encoding