Create Breathtaking Architectural Videos with Our Advanced Low-Memory Solution
Generate long architectural animations with low VRAM! Discover a powerful workflow featuring FramePack, CLIP vision, and text prompts for motion control. Learn how to optimize memory and create stunning sequences with our step-by-step guide.
- VRAM
- Low VRAM (≤8GB)
- Reading Time
- 3 min
Workflow Overview
Generate long architectural animations with low VRAM! Discover a powerful workflow featuring FramePack, CLIP vision, and text prompts for motion control. Learn how to optimize memory and create stunning sequences with our step-by-step guide.
Content type: Workflow
Primary intent: Download
Setup Notes
- Install the required models before opening the workflow template.
- Recommended hardware: Low VRAM (≤8GB).
1. Workflow Overview

A low-VRAM animation generator for architectural scenes featuring:
Long Sequences: 60s generation with 6GB VRAM via
FramePackMemory Optimization: Tiled decoding + temporal slicing
Multimodal Control: CLIP vision + text prompts for motion
Prompt Assistant: Built-in template in
Notenode
2. Core Models
Model | Function | Source |
|---|---|---|
| Video diffusion (BF16) | |
| Lightweight VAE | Manual download |
| Visual encoder | Auto-installed |
3. Key Nodes
Node | Purpose | Installation |
|---|---|---|
| Frame-wise sampling | |
| Memory-efficient decoding | Built-in (enable |
| Video rendering | |
| Smart resizing | ComfyUI Manager |
4. Pipeline Stages
Stage 1: Input Processing
Image Input: Load via
LoadImage(e.g.,work-04.jpg)Resolution Matching: Auto-optimize with
FramePackFindNearestBucketFeature Extraction:
CLIPVisionEncodeencodes visual cues
Stage 2: Animation Core
Sampling:
30 steps, CFG=10, UniPC-BH1 sampler
VRAM optimizations:
teacache(0.15) +temporal_size=64
Motion Control: Text prompts (e.g., "slow zoom-in")
Stage 3: Output
Tiled Decoding: 128x128 blocks via
VAEDecodeTiledVideo Export: MP4 output (30FPS, H.264)
5. Inputs & Outputs
Required Inputs:
Static architecture image
Motion prompts (Chinese preferred)
Outputs:
MP4 video (
FramePack_00001.mp4)Resolution: 512x512 to 1024x1024
6. Critical Notes
VRAM Management:
Reduce
total_second_length(1GB/10s)Enable
gpu_memory_preservation(default=6)
Dependencies:
Download
FramePackI2V_HYandhunyuan_video_vae_bf16CLIP models auto-download
Troubleshooting:
CUDA OOM→ Lowerlatent_window_size(default=9)Choppy video → Ensure
temporal_overlap≥8