How much VRAM is recommended?

Discover how to retarget motion from a source video to a target character using the Wan2.1-Fun-Control model, a powerful tool for creating realistic character animations. Learn the workflow, key technologies, and core models involved in this innovative process.

Use Case: Video
Best For: Video
Models: Wan2.1
VRAM: Low VRAM (≤8GB)
Reading Time: 4 min

View Required Models More Video Workflows

Workflow Overview

Content type: Workflow

Primary intent: Download

Required Models

Wan2.1

Setup Notes

Install the required models before opening the workflow template.
Recommended hardware: Low VRAM (≤8GB).

1. Workflow Overview

Purpose: Motion retargeting from a source video to a target character using Wan2.1-Fun-Control model.
Key Tech:
- Pose Extraction: DWPreprocessor detects keypoints from input video.
- Multimodal Control: CLIP vision + T5 text + depth maps (DepthAnythingPreprocessor).
- Temporal Coherence: WanFunControlToVideo generates frame-consistent videos.

2. Core Models

Model Name	Function
Wan2.1-Fun-Control-14B	Base motion control model (14B params, FP8 optimized).
umt5-xxl_fp8_e4m3fn_scaled	Text encoder for prompts (e.g., negative prompts to filter bad frames).
depth_anything_vitl14	Depth preprocessor for spatial consistency.

3. Key Nodes

3.1 Input Processing

VHS_LoadVideo:
- Loads input video (e.g., 5月12日 0.8.mp4), extracts frames (25FPS default).
LoadImage:
- Loads target character image (e.g., 00088-3677135724.png).

3.2 Motion Analysis

DWPreprocessor:
- Extracts pose keypoints (using yolox_l.onnx and dw-ll_ucoco_384).
DepthAnythingPreprocessor:
- Generates depth maps for background alignment.

3.3 Video Generation

WanFunControlToVideo:
- Key params: 832x480 output, 81 frames (~3.24s), CFG=1.0.
- Inputs: Pose keypoints + CLIP features + text conditioning.
KSampler:
- Settings: 20 steps, Euler sampler, fixed seed (198).

3.4 Post-Processing

SkipLayerGuidanceWanVideo:
- Skips UNet layers (9,10) at 0.2 strength for detail/fluency balance.
WanVideoEnhanceAVideoKJ:
- Reduces flickering (strength=0.2).

4. Workflow Structure

Stage	Key Nodes	Function
Input Prep	VHS_LoadVideo + LoadImage	Loads video and target image.
Motion Extract	DWPreprocessor → DepthAnything	Extracts poses and depth maps.
Conditioning	CLIPTextEncode + CLIPVisionEncode	Encodes text/visual conditions.
Video Gen	WanFunControlToVideo → KSampler	Renders motion-retargeted frames.
Output Export	VHS_VideoCombine	Final video (H.264, CRF=15).

5. Inputs & Outputs

Inputs:
- Source video (MP4, 25FPS recommended).
- Target character image (PNG/JPG, transparent background preferred).
- Optional text prompts (style control).
Output:
- Motion-retargeted video (default 832x480, 25FPS).

6. Notes

Hardware:
- 16GB+ VRAM (RTX 4080+ recommended for 14B model).
- Enable FP8 optimization (fp8_e4m3fn) for lower VRAM usage.
Dependencies:
- Download Wan2.1-Fun-Control-14B and depth_anything_vitl14.pth manually.
Troubleshooting:
- Reduce flickering: Increase KSampler steps (20→30) or lower SkipLayerGuidance strength (0.2→0.1).
- Resolution errors: Match video/image aspect ratios (e.g., 512x512).

FAQ

Related Workflows

Related by Use Case

Unlock Anime-Style Video Magic: A Step-by-Step WAN2.1 Workflow Guide

Generate Anime-Style Videos with WAN2.1 Model: Learn how to convert input videos to anime style with dynamic prompts, HunyuanLoom technology, and outputs 16fps MP4 videos. Try this workflow now!

Transform Your Videos into Stylized Animations with Advanced AI Technology

Unlock the power of video stylization with our workflow! Transform input videos into stunning animations using Wan2.1 model, AnimeLineArt, and DepthAnything. Discover how to harness ControlNet, T5 text encoding, and frame interpolation for dynamic content. Learn more and get started now!

Wan2.7 Is Now Available in ComfyUI via Partner Nodes

Wan2.7 is a comprehensive upgrade over 2.6 — better quality, audio, dynamics, and a full suite of creative workflows now available in ComfyUI via Partner Nodes

Create Stunning Animated Videos with Ease: A Flux.1 and WanVideo Tutorial

Generate stunning images and videos with Flux.1 and WanVideo plugins. Learn how to integrate these models for high-quality image and video creation. Get started now!

Related by Model

Unlock Anime-Style Video Magic: A Step-by-Step WAN2.1 Workflow Guide

Generate Anime-Style Videos with WAN2.1 Model: Learn how to convert input videos to anime style with dynamic prompts, HunyuanLoom technology, and outputs 16fps MP4 videos. Try this workflow now!

Transform Your Videos into Stylized Animations with Advanced AI Technology

From Images to Videos: A Deep Dive into the Wan2.1-I2V Workflow

Unlock AI-powered video generation with Alibaba's Wan2.1 model! Learn how to create stunning videos from static images using this workflow guide.

Unlock the Power of Text-to-Video Generation with Aliyun's Wan2.1 Model

Generate dynamic videos with text prompts using Aliyun's Wan2.1 model! Learn how to utilize this Text-to-Video workflow with Chinese support, customizable frame rates, and resolutions. Discover the core models, key nodes, and workflow structure.

Looking for more Video workflows? Browse the Video hub for additional templates and guides.

Unlock Next-Level Animation: First-Frame Controlled Video Generation Pipeline

Unlock Time-Lapse Aging Videos with Wan2.1 I2V Model: A Step-by-Step Guide

Summary

Chapter

workflow:

CustomNodes:

DWPreprocessor VHS_VideoCombin...

workflow

Unlocking Realistic Motion Retargeting: A Deep Dive into Wan2.1-Fun-Control