Karine-Huang/GenMAC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Karine-Huang/GenMAC)

Karine-Huang / GenMAC

[AAAI 2026] GenMAC for Compositional Text-to-Video Generation

☆35

Alternatives and similar repositories for GenMAC

Users that are interested in GenMAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KaiyueSun98 / T2I-ReasonBench
View on GitHub
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
☆38Sep 16, 2025Updated 10 months ago
KaiyueSun98 / T2I-Personalization-with-AR
View on GitHub
☆47Apr 20, 2025Updated last year
qiulu66 / Anime-Shooter
View on GitHub
☆56Jun 4, 2025Updated last year
HKU-MMLab / UniClawBench
View on GitHub
UniClawBench project page: https://uniclawbench.github.io/
☆37Updated this week
TencentARC / GRPO-CARE
View on GitHub
[ACL2026 Findings] GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning
☆83Jun 23, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KaiyueSun98 / T2V-CompBench
View on GitHub
[CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
☆123Oct 25, 2025Updated 9 months ago
gogoduan / GoT-R1
View on GitHub
[ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning
☆106Jan 27, 2026Updated 6 months ago
SilentView / EMCID
View on GitHub
Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"
☆19Mar 21, 2024Updated 2 years ago
HKU-MMLab / Math-VR-CodePlot-CoT
View on GitHub
Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images
☆63Nov 4, 2025Updated 8 months ago
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
YuqingWang1029 / TokenBridge
View on GitHub
[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…
☆158Jul 24, 2025Updated last year
InternRobotics / OST-Bench
View on GitHub
[NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
☆80Sep 29, 2025Updated 9 months ago
Karine-Huang / T2I-CompBench
View on GitHub
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
☆345May 7, 2026Updated 2 months ago
qiulu66 / EgoPlan-Bench2
View on GitHub
☆31Apr 11, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Yukun-Huang / DreamCube
View on GitHub
[ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".
☆181Feb 4, 2026Updated 5 months ago
rongyaofang / prism-bench
View on GitHub
This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…
☆131Jan 29, 2026Updated 5 months ago
rongyaofang / PUMA
View on GitHub
Empowering Unified MLLM with Multi-granular Visual Generation
☆132Jan 16, 2025Updated last year
HKU-MMLab / Macro
View on GitHub
The official repo of "MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data"
☆67Mar 27, 2026Updated 4 months ago
TencentARC / Moto
View on GitHub
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆180Oct 1, 2025Updated 9 months ago
SilentView / LVD-2M
View on GitHub
[NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"
☆79Oct 15, 2024Updated last year
yhyang-myron / DreamComposer
View on GitHub
[CVPR 2024] DreamComposer: Controllable 3D Object Generation via Multi-View Conditions
☆135Jul 22, 2024Updated 2 years ago
traveler-framework / TraveLER
View on GitHub
[EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering
☆18Oct 31, 2024Updated last year
vivo / DiMo-GUI
View on GitHub
[EMNLP 2025]Repository for paper "DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning"
☆30Jul 2, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
SEU-VIPGroup / Understanding_Vision_Tasks
View on GitHub
☆13Feb 2, 2025Updated last year
CIntellifusion / MultiWorld
View on GitHub
Official Implementation of MultiWorld: Scalable Multi-Agent Multi-View Video World Models
☆247May 12, 2026Updated 2 months ago
KlingAIResearch / GameFactory
View on GitHub
[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos
☆492Mar 22, 2025Updated last year
HKU-MMLab / EVATok
View on GitHub
[CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"
☆61Mar 13, 2026Updated 4 months ago
MaureenZOU / detectron2-xyz
View on GitHub
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
☆19May 7, 2022Updated 4 years ago
TencentARC / Divot
View on GitHub
Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)
☆87Feb 27, 2025Updated last year
zhenyuw16 / GenArtist
View on GitHub
Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"
☆168Oct 23, 2024Updated last year
MCG-NJU / DMM
View on GitHub
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
☆47Apr 27, 2025Updated last year
weixi-feng / TC-Bench
View on GitHub
☆27Jun 22, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KlingAIResearch / RoboMaster
View on GitHub
[ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
☆107Feb 8, 2026Updated 5 months ago
MICV-yonsei / STORM
View on GitHub
[CVPR 2025] Official Pytorch Code for Spatial Transport Optimization by Repositioning Attention Map for Training-Free Text-to-Image Synth…
☆15Jun 21, 2025Updated last year
yuexy / ST-AR
View on GitHub
☆14Sep 22, 2025Updated 10 months ago
VisionXLab / AdapTok
View on GitHub
[CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
☆29Mar 15, 2026Updated 4 months ago
Pointcept / SAMPart3D
View on GitHub
SAMPart3D: Segment Any Part in 3D Objects
☆564May 4, 2025Updated last year
technion-cs-nlp / ReFACT
View on GitHub
☆13Apr 3, 2024Updated 2 years ago
yayafengzi / ALToLLM
View on GitHub
ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation
☆30May 27, 2025Updated last year