Boost Your Video Creation with Wan2.1, RIFE, and CR Upscaling

CN
ComfyUI.org
2025-04-14 09:02:24

Unlock powerful Image-to-Video generation with Wan2.1 model and ComfyUI. Discover how to boost FPS and resolution with RIFE interpolation and CR upscaling. Learn more and start creating!

Use Case
Video
Best For
Video
Key Nodes
Upscaler
VRAM
Medium VRAM (12–16GB)
Reading Time
4 min
View Required ModelsMore Video Workflows

Workflow Overview

Unlock powerful Image-to-Video generation with Wan2.1 model and ComfyUI. Discover how to boost FPS and resolution with RIFE interpolation and CR upscaling. Learn more and start creating!

Content type: Workflow

Primary intent: Download

Required Models

  • Wan2.1
  • Lora

Required Nodes

  • Upscaler

Setup Notes

  • Install the required models before opening the workflow template.
  • Recommended hardware: Medium VRAM (12–16GB).

1. Workflow Overview

m9guhhdpaohq204qfq7ca3139645b39c20a8c773ee3a503a5833dbc8cddb672b39abc7c5cae5a85f6f.gif

This workflow leverages Wan2.1 model for Image-to-Video (I2V) generation, with:

  1. Input: Single image + text prompt → Low-res video generation

  2. RIFE interpolation + CR upscaling for higher FPS (32fps) and resolution

  3. KJ acceleration (TeaCache/BlockSwap) to optimize VRAM usage

  4. Outputs: Original (16fps) and enhanced videos (32fps)

2. Core Models

Model Name

Function

Source

Wan2_1-I2V-14B-480P_fp8_e4m3fn.safetensors

Main I2V model

Manual download (e.g., HuggingFace)

umt5-xxl-enc-bf16.safetensors

T5 text encoder

Required companion model

R-ESRGAN_4x+ Anime6B

Super-resolution upscaler

Install via ComfyUI Manager

rife47.pth

Frame interpolation (RIFE)

Manual GitHub install

3. Key Nodes

Node

Purpose

Installation

WanVideoSampler

Controls video sampling (UniPC)

Built-in

WanVideoLoraSelect

Loads LoRA for style tuning

Requires Wan plugin

RIFE VFI

Frame interpolation (32fps output)

Install ComfyUI-Frame-Interpolation

CR Upscale Image

3x resolution upscaling

Install ComfyUI-ESRGAN

4. Workflow Groups

  • Group 1: Wan2.1 Model Loading

    • Input: Model files, VAE, LoRA

    • Output: Initialized video generation model

  • Group 2: Text & Image Encoding

    • Input: Prompt (e.g., "1girl, golden hair"), negative prompt, uploaded image

    • Output: CLIP image embeds + T5 text embeds

  • Group 3: Acceleration Nodes

    • Critical params: BlockSwap=20 (VRAM safety), TeaCache=0.04 (speed boost)

  • Group 4: Initial Video Synthesis

    • Output: 480P raw video (16fps)

  • Group 5: Upscale + Interpolation

    • Pipeline: CR upscale → RIFE interpolation → 1080P output (32fps)

5. Inputs & Outputs

  • Input Parameters:

    • Required: Image path, Prompt, Seed (41387343190862)

    • Optional: LoRA model, RIFE multiplier (10)

  • Output:

    • teacache_00002.mp4 (raw video)

    • xiao_00001.mp4 (HD interpolated video)

6. Notes

  1. VRAM: Recommended ≥12GB GPU (e.g., RTX 3060+). Enable BlockSwap to reduce usage.

  2. Compatibility: Wan models must be bf16/fp8 format.

  3. Troubleshooting: If umt5-xxl-enc-bf16.safetensors is missing, download from HuggingFace to ComfyUI/models/wan_video.

  4. Optimization: Adjust TeaCache (0.01~0.05) for speed/quality trade-off.

FAQ