What models does this workflow require?

Unlock AI-powered video character redrawing with Wan2.1Fun! Discover how this workflow leverages Stable Diffusion, GroundingDino, and Openpose to transform characters into stylized images and videos. Learn more and elevate your video editing skills!

Use Case: Video
Best For: Video
Models: Wan2.1
Controlnet
Sd
Key Nodes: Controlnet
VRAM: Medium VRAM (12–16GB)
Reading Time: 4 min

View Required Models More Video Workflows

Workflow Overview

Content type: Workflow

Primary intent: Download

Required Models

Wan2.1
Controlnet
Sd

Required Nodes

Controlnet

Setup Notes

Install the required models before opening the workflow template.
Recommended hardware: Medium VRAM (12–16GB).

1. Workflow Overview

m9bcut8pbdpiuechjkw6acf8a263f6107cfe1be787f47dac5a192127eb2c95e5b29502af4c8bfad8e83.png

This workflow, named “wan2.1Fun_Video Character Redraw”, converts characters in a video into stylized images or videos using AI models. Key technologies include:

Frame Extraction: Extracts key frames from input video.
Segmentation & Pose Detection: Uses GroundingDino+SAM for person segmentation and Openpose for pose keypoints.
Text/Image-Guided Generation: Generates new content via Stable Diffusion (Wan2.1-Fun-Control).
Video Synthesis: Combines frames into a final video.

2. Core Models

Stable Diffusion (Wan2.1-Fun-Control-14B)
- Purpose: Generates high-quality images/videos from text/image prompts.
- Model File: Wan2.1-Fun-Control-14B_fp8_e4m3fn.safetensors.
GroundingDino + SAM
- Purpose: Detects and segments characters (e.g., man label).
- Model Files: GroundingDINO_SwinT_OGC, sam_vit_b_01ec64.pth.
ControlNet (Openpose)
- Purpose: Preserves original pose structure.
- Model File: control_v11p_sd15_openpose.pth.
Florence2
- Purpose: Auto-generates image captions (prompt inversion).
- Model File: Florence-2-large.

3. Key Nodes

Video Input:
- VHS_LoadVideo: Loads video files (e.g., 2795746-uhd_2160_3840_25fps.mp4).
Character Processing:
- GroundingDinoSAMSegment: Segments characters and generates masks.
- OpenposePreprocessor: Extracts pose keypoints.
Generation Control:
- WanVideoTextEncode: Processes text prompts (e.g., "futuristic robot").
- WanVideoSampler: Controls sampling (steps=25, CFG=8).
Output Synthesis:
- VHS_VideoCombine: Combines frames into MP4 (H.264).

4. Workflow Structure (Groups)

Frame Redraw (Text-Based)
- Input: Video + text prompts.
- Output: Redrawn first frame.
Wan2.1 Character Conversion
- Input: Masks + pose data.
- Output: Stylized video.
Prompt Inversion (Florence2)
- Input: Reference image.
- Output: Auto-generated detailed caption.

5. Inputs & Outputs

Inputs:
- Video file (MP4).
- Optional text prompts.
- Generation params (512x910, Euler sampler).
Output:
- Generated video (e.g., AnimateDiff_00027.mp4).

6. Notes

Dependencies:
- Install via ComfyUI Manager:
  - ComfyUI-WanVideoWrapper (video generation).
  - comfyui_controlnet_aux (pose extraction).
  - comfyui-florence2 (prompt inversion).
Hardware:
- Recommended VRAM ≥12GB (Wan2.1 model is large).
Troubleshooting:
- Model path errors: Verify .safetensors file locations.
- Video encoding issues: Adjust CRF in VHS_VideoCombine (default=19).

FAQ

Related Workflows

Related by Use Case

Transform Your Videos into Stylized Animations with Advanced AI Technology

Unlock the power of video stylization with our workflow! Transform input videos into stunning animations using Wan2.1 model, AnimeLineArt, and DepthAnything. Discover how to harness ControlNet, T5 text encoding, and frame interpolation for dynamic content. Learn more and get started now!

Unlock Advanced Video Depth Control with Wan Model-Based Workflow

Unlock AI-powered video depth control with our Wan model-based workflow. Discover how to extract depth maps, stylize videos with text guidance, and more. Dive into the details now!

Mastering Video-to-Video Translation: A Deep Dive into Wan2.1 VACE Model and ComfyUI

Unlock AI-powered video translation with Wan2.1 VACE Model! Discover a workflow that enhances each frame, controls depth, and optimizes generation. Learn how to leverage this innovative technology and transform your video content today!

From Pose to Playback: Mastering Video Generation with Tongyi Wanxiang's Fun-ControlNet

Tongyi Wanxiang-WAN2.1-Fun ControlNet Video Generation: Create dynamic videos with pose/depth control & style control. Learn how this workflow generates videos, controls content, and upscales resolution.

Related by Model

Transform Your Videos into Stylized Animations with Advanced AI Technology

Unlock Advanced Video Depth Control with Wan Model-Based Workflow

Unlock AI-powered video depth control with our Wan model-based workflow. Discover how to extract depth maps, stylize videos with text guidance, and more. Dive into the details now!

Mastering Video-to-Video Translation: A Deep Dive into Wan2.1 VACE Model and ComfyUI

From Pose to Playback: Mastering Video Generation with Tongyi Wanxiang's Fun-ControlNet

Unlock Advanced Video Depth Control with Wan Model-Based Workflow

Unlock AI-powered video depth control with our Wan model-based workflow. Discover how to extract depth maps, stylize videos with text guidance, and more. Dive into the details now!

Mastering Video-to-Video Translation: A Deep Dive into Wan2.1 VACE Model and ComfyUI

From Pose to Playback: Mastering Video Generation with Tongyi Wanxiang's Fun-ControlNet

Low VRAM Alternatives

Transform Your Videos into Stylized Animations with Advanced AI Technology

Transforming Static Images into Cinematic Explosions with Wan2.1

Unlock explosive video effects with Wan2.1! Transform static images into dynamic clips with fire, debris & more. Discover the ultimate workflow for high-dynamic explosion effect videos.

Mastering Video-to-Video Translation: A Deep Dive into Wan2.1 VACE Model and ComfyUI

From Pose to Playback: Mastering Video Generation with Tongyi Wanxiang's Fun-ControlNet

Looking for more Video workflows? Browse the Video hub for additional templates and guides.

Unleash the Power of WanVideo: Create Stunning Sticker Peeling Effect Videos

Unlock Cinematic Mastery: Ultra-HD Photography Workflow Revealed

Summary

Chapter

workflow:

CustomNodes:

GroundingDinoModelLoader (segm...

workflow

Unleash AI-Powered Video Character Redraw: Transforming Videos with Style

Workflow Overview

Required Models

Required Nodes

Setup Notes

1. Workflow Overview

2. Core Models

3. Key Nodes

4. Workflow Structure (Groups)

5. Inputs & Outputs

6. Notes

FAQ

What models does this workflow require?

How much VRAM is recommended?

Can this workflow be used commercially?

Which ComfyUI nodes are involved?

Related Workflows

Related by Use Case

Related by Model

Related by Node

Low VRAM Alternatives

Summary

Chapter