Wan2.1_flf2v_720p_14b_fp16.safetensors
Currently, this model is likely supported by custom ComfyUI nodes or dedicated forks of diffusers . If you have the VRAM, the workflow generally looks like this:
Specifically, today we are looking at the file that has been popping up on Hugging Face and various model hubs: . wan2.1_flf2v_720p_14b_fp16.safetensors
This is a . It is significantly heavier than typical SDXL or standard video models. Currently, this model is likely supported by custom
The "FLF2V" in the filename stands for . Unlike standard text-to-video models that generate content from scratch, FLF2V allows users to provide both a starting image and an ending image. The model then intelligently "inpaints" the intermediate frames to create a logically coherent transition between the two. Core Specifications wan2.1_flf2v_720p_14b_fp16.safetensors