What models does this workflow require?

Transform videos into stylized animations with Wan2.1 VACE, Pose Control, and Depth Control. Discover how to leverage AI models for stunning visual effects and learn how to use this workflow to elevate your video editing skills.

Use Case: Video
Best For: Video
Models: Flux
Wan2.1
VRAM: Medium VRAM (12–16GB)
Reading Time: 4 min

View Required Models More Video Workflows

Workflow Overview

Content type: Workflow

Primary intent: Download

Required Models

Flux
Wan2.1

Setup Notes

Install the required models before opening the workflow template.
Recommended hardware: Medium VRAM (12–16GB).

1. Workflow Overview

m9wlozlo7in3t7kocz23a3ea37ed55437a7436110c0b4c4e1fa8a6121ea0f62e25f6d6cb5f43b5f7fe.gif

Purpose:
This workflow transforms input videos into stylized animations using Wan2.1 VACE with:
- Pose Control (OpenPose) and Depth Control (Depth Map)
- Frame interpolation (FILM VFI) and video upscaling
- Auto-prompt generation via Florence2
Core Models:
- Wan2.1 VACE: Main video generation model for style transfer
- Florence2: Image captioning model for auto-prompts
- DepthAnything V2: Depth map generator for structural control
- FILM VFI: Frame interpolation model (16FPS → 32FPS)

2. Key Nodes

Node	Function	Installation	Dependencies
`WanVideoModelLoader`	Loads Wan2.1 model	`ComfyUI-WanVideoWrapper`	Download models: HuggingFace
`DepthAnything_V2`	Generates depth maps	`ComfyUI-DepthAnythingV2`	Requires `depth_anything_v2_vitl_fp16.safetensors`
`Florence2Run`	Auto-generates prompts	`ComfyUI-Florence2`	Load `Florence-2-Flux-Large` model
`FILM VFI`	Frame interpolation	Built-in	Download `film_net_fp32.pt`
`VHS_VideoCombine`	Video rendering/export	`ComfyUI-VideoHelperSuite`	Requires FFmpeg

3. Workflow Structure

Group 1: Input Setup

Inputs: Video file, reference image, seed, resolution cap (e.g., 1280x720)
Outputs: Preprocessed frames

Group 2: Control Generation

Pose Control: OpenPose keypoints via DWPreprocessor
Depth Control: Depth maps via DepthAnything_V2
Prompts: Manual input or auto-generated by Florence2

Group 3: Video Generation

Wan2.1 Model: Generates latent video frames
VACE Encoding: Encodes frames for model processing

Group 4: Post-Processing

Frame Interpolation: Upsamples to 32FPS with FILM VFI
Video Export: Combines frames into MP4

4. Inputs & Outputs

Required Inputs:
- Video file (MP4)
- Reference image (e.g., Girl_85_Highres.png)
- Positive prompt (e.g., "Night scene, a dancing girl")
- Resolution cap (default: 1280)
Output:
- Final video (saved to output/Video)
- Intermediate results (depth maps, pose keypoints)

5. Notes

Hardware:
- ≥12GB VRAM (use BlockSwap for lower VRAM)
- Enable Triton/SageAttn for 20%-50% speed boost
Troubleshooting:
- Download missing models via ComfyUI Manager
- Depth control is more stable than pose control
Optimization:
- Adjust blocks_to_swap (30-40) in WanVideoBlockSwap

FAQ

Related Workflows

Related by Use Case

Transform Your Videos into Stylized Animations with Advanced AI Technology

Unlock the power of video stylization with our workflow! Transform input videos into stunning animations using Wan2.1 model, AnimeLineArt, and DepthAnything. Discover how to harness ControlNet, T5 text encoding, and frame interpolation for dynamic content. Learn more and get started now!

Create Stunning Animated Videos with Ease: A Flux.1 and WanVideo Tutorial

Generate stunning images and videos with Flux.1 and WanVideo plugins. Learn how to integrate these models for high-quality image and video creation. Get started now!

Mastering Video-to-Video Translation: A Deep Dive into Wan2.1 VACE Model and ComfyUI

Unlock AI-powered video translation with Wan2.1 VACE Model! Discover a workflow that enhances each frame, controls depth, and optimizes generation. Learn how to leverage this innovative technology and transform your video content today!

Unlock Anime-Style Video Magic: A Step-by-Step WAN2.1 Workflow Guide

Generate Anime-Style Videos with WAN2.1 Model: Learn how to convert input videos to anime style with dynamic prompts, HunyuanLoom technology, and outputs 16fps MP4 videos. Try this workflow now!

Related by Model

Transform Your Videos into Stylized Animations with Advanced AI Technology

Create Stunning Animated Videos with Ease: A Flux.1 and WanVideo Tutorial

Generate stunning images and videos with Flux.1 and WanVideo plugins. Learn how to integrate these models for high-quality image and video creation. Get started now!

Mastering Video-to-Video Translation: A Deep Dive into Wan2.1 VACE Model and ComfyUI

Unlock Anime-Style Video Magic: A Step-by-Step WAN2.1 Workflow Guide

Generate Anime-Style Videos with WAN2.1 Model: Learn how to convert input videos to anime style with dynamic prompts, HunyuanLoom technology, and outputs 16fps MP4 videos. Try this workflow now!

Low VRAM Alternatives

Create Stunning Animated Videos with Ease: A Flux.1 and WanVideo Tutorial

Generate stunning images and videos with Flux.1 and WanVideo plugins. Learn how to integrate these models for high-quality image and video creation. Get started now!

Transform Your Videos into Stylized Animations with Advanced AI Technology

Mastering Video-to-Video Translation: A Deep Dive into Wan2.1 VACE Model and ComfyUI

Revive Your Videos: AI-Driven Frame-Level Restoration and Enhancement

Unlock AI-powered video restoration! Discover how to repair blurry videos with frame-level enhancement and style migration using cutting-edge models like Wan2_1-T2V-1_3B_bf16. Learn how to install and utilize these models for stunning video re-rendering and high-definition restoration.

Looking for more Video workflows? Browse the Video hub for additional templates and guides.

Unlock Seamless Product Background Blending with This AI-Powered Workflow

Unlock Animated Wing Effects with WAN2.1: A Step-by-Step Workflow

Summary

Chapter

workflow:

CustomNodes:

Note WanVideoTorchCompileSetti...