WAN 2.1 Text-to-Video 1.3B (480P) - Workflow

This workflow leverages WAN 2.1’s Text-to-Video 1.3B model to generate high-quality 480P videos from text prompts. It integrates advanced video generation techniques with ComfyUI’s flexible node-based workflow to create smooth and coherent animations from AI-generated frames.

🎯 Features

Text-to-Video Generation – Converts text descriptions into dynamic 480P videos.
Efficient VRAM Usage – Runs on consumer-grade GPUs with only 8.19GB VRAM required.
Temporal Consistency – WAN-VAE ensures stable motion and clear details across frames.

💡 Use Cases

🎬 Creative Content – Generate short animations for storytelling, marketing, or social media.
🎮 Game & Virtual Worlds – Prototype in-game cinematics and animated backgrounds.
📺 AI-Assisted Filmmaking – Experiment with AI-generated video concepts and scene planning.

File	Destination	Source
wan_2.1_vae.safetensors	/ComfyUI/models/vae	Download
umt5_xxl_fp8_e4m3fn_scaled.safetensors	/ComfyUI/models/text_encoders	Download
wan2.1_t2v_1.3B_bf16.safetensors	/ComfyUI/models/diffusion_models	Download

Run this workflow on InstaSD

Description

🎯 Features

💡 Use Cases

Models

Nodes