THUDM / CogView4Links

CogView4, CogView3-Plus and CogView3(ECCV 2024)

☆1,076

Alternatives and similar repositories for CogView4

Users that are interested in CogView4 are comparing it to the libraries listed below

Sorting:

stepfun-ai / Step1X-Edit
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…
☆1,510Updated this week
IamCreateAI / Ruyi-Models
☆519Updated 5 months ago
RedAIGC / StoryMaker
StoryMaker: Towards consistent characters in text-to-image generation
☆702Updated 7 months ago
TencentARC / BrushEdit
[TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
☆571Updated 6 months ago
SkyworkAI / SkyReels-A1
SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
☆540Updated last month
AIGText / Glyph-ByT5
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and…
☆597Updated last month
aigc-apps / EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
☆2,177Updated 4 months ago
jianzongwu / DiffSensei
Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"
☆823Updated 5 months ago
bytedance / UNO
[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
☆1,154Updated 2 months ago
Tencent-Hunyuan / HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
☆1,115Updated last month
Tencent-Hunyuan / InstantCharacter
☆1,009Updated 2 months ago
Phantom-video / Phantom
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
☆1,280Updated 2 weeks ago
alimama-creative / FLUX-Controlnet-Inpainting
☆737Updated 7 months ago
MeiGen-AI / MultiTalk
Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
☆1,207Updated last week
showlab / PhotoDoodle
[ICCV 2025] Code Implementation of "ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples"
☆410Updated 2 months ago
Xiaojiu-z / EasyControl
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)
☆1,594Updated this week
TencentARC / SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
☆855Updated 9 months ago
aigc-apps / VideoX-Fun
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
☆1,194Updated this week
ali-vilab / In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
☆1,946Updated 6 months ago
Picsart-AI-Research / StreamingT2V
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
☆1,579Updated 3 months ago
MyNiuuu / MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
☆751Updated 7 months ago
Alpha-VLLM / Lumina-Image-2.0
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
☆740Updated 2 weeks ago
cangcz / AnchorCrafter
☆573Updated last week
erwold / qwen2vl-flux
☆541Updated 7 months ago
stepfun-ai / Step-Video-TI2V
☆344Updated 3 months ago
donahowe / AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
☆445Updated 3 months ago
Open-Magic-Video / Magic-1-For-1
☆749Updated 4 months ago
SkyworkAI / SkyReels-V1
SkyReels V1: The first and most advanced open-source human-centric video foundation model
☆2,225Updated 4 months ago
Tencent-Hunyuan / HunyuanVideo-I2V
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
☆1,561Updated last month
modelscope / scepter
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
☆526Updated 3 months ago