SemanticScuttle - klotz.me » klotz: text-to-video

klotz: text-to-video*

Wan2.1-I2V-14B-720P: Open and Advanced Large-Scale Video Generative Models

The article introduces Wan2.1, a suite of open video foundation models excelling in various tasks like Text-to-Video and Image-to-Video generation. It highlights key features such as SOTA performance on consumer-grade GPUs, support for multiple tasks, and efficient video VAEs. The I2V-14B model, capable of generating 720P videos, is noted for its superior performance across benchmarks.

2025-02-25 Tags: wan2.1, video generation, text-to-video, image-to-video, vae, gai by klotz

HunyuanVideo: A Systematic Framework For Large Video Generation Model Training

HunyuanVideo is an open-source video generation model that showcases performance comparable to or superior to leading closed-source models. It includes features like a unified image and video generative architecture, a large language model text encoder, and a causal 3D VAE for spatial-temporal compression.

2024-12-05 Tags: hunyuanvideo, text-to-video, llm, hugging face, tencent, machine learning by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: text-to-video*

Linked Tags

Related Tags