yejy53 / EchoLinks

☆115

Alternatives and similar repositories for Echo

Users that are interested in Echo are comparing it to the libraries listed below

Sorting:

TencentARC / MindOmni
☆125Updated 3 months ago
aniki-ly / FreeLong
[NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…
☆56Updated 2 months ago
CodeGoat24 / Pref-GRPO
Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
☆160Updated last week
hqhQAQ / PatchDPO
[CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation
☆40Updated 2 months ago
csuhan / Tar
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
☆165Updated 2 weeks ago
CodeGoat24 / LiFT
Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.
☆83Updated 4 months ago
YuqingWang1029 / PAR
[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project
☆171Updated 5 months ago
QianWangX / EditCLIP
Implementation of paper EditCLIP: Representation Learning for Image Editing (ICCV 2025)
☆26Updated 2 months ago
viiika / HumanEdit
[CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…
☆34Updated 4 months ago
KaiyueSun98 / T2V-CompBench
[CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
☆91Updated 3 months ago
fusiming3 / MARS
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
☆85Updated last year
Eureka-Maggie / MIGE
Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing
☆69Updated 2 months ago
NJU-PCALab / InstanceCap
[CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍
☆47Updated 2 months ago
luping-liu / LongAlign
The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)
☆79Updated 4 months ago
Litalby1 / make-it-count
Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)
☆88Updated 6 months ago
yuriYanZeXuan / EEdit
(ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing
☆52Updated this week
gogoduan / GoT-R1
GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning
☆96Updated 3 months ago
illume-unified-mllm / ILLUME_plus
☆119Updated 3 weeks ago
weichow23 / AnySD
Official model implementation and benchmark evaluation repository of <AnyEdit: Unified High-Quality Image Edit with Any Idea>
☆28Updated 2 months ago
Osilly / Interleaving-Reasoning-Generation
This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark perform…
☆28Updated last week
SilentView / LVD-2M
[NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"
☆69Updated 11 months ago
KwaiVGI / DiffMoE
PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT
☆129Updated 4 months ago
TempleX98 / EasyRef
[ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
☆65Updated 2 months ago
Gen-Verse / HermesFlow
HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation
☆64Updated 7 months ago
HVision-NKU / K-LoRA
Official code for K-LoRA (CVPR 2025)
☆123Updated 3 months ago
mayuelala / FollowYourShape
[ArXiv 2025] Follow-Your-Shape: This repo is the official implementation of "Follow-Your-Shape: Shape-Aware Image Editing via Trajectory…
☆48Updated last month
PKU-YuanGroup / UAE
Official repository for the UAE paper, unified-GRPO, and unified-Bench
☆69Updated this week
Mowenyii / PAE
[CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation
☆80Updated last year
ThisisBillhe / NAR
The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"
☆57Updated 5 months ago
chenllliang / DreamEngine
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
☆119Updated 6 months ago