KaiyueSun98/T2I-Personalization-with-AR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KaiyueSun98/T2I-Personalization-with-AR)

KaiyueSun98 / T2I-Personalization-with-AR

☆47

Alternatives and similar repositories for T2I-Personalization-with-AR

Users that are interested in T2I-Personalization-with-AR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KaiyueSun98 / T2I-ReasonBench
View on GitHub
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
☆38Sep 16, 2025Updated 10 months ago
Karine-Huang / GenMAC
View on GitHub
[AAAI 2026] GenMAC for Compositional Text-to-Video Generation
☆35Jan 10, 2026Updated 6 months ago
TencentARC / GRPO-CARE
View on GitHub
[ACL2026 Findings] GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning
☆83Jun 23, 2025Updated last year
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
HKU-MMLab / OmniX
View on GitHub
Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".
☆104Mar 31, 2026Updated 3 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
TencentARC / SEED-Bench-R1
View on GitHub
☆100Jun 23, 2025Updated last year
SilentView / EMCID
View on GitHub
Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"
☆19Mar 21, 2024Updated 2 years ago
qiulu66 / Anime-Shooter
View on GitHub
☆58Jun 4, 2025Updated last year
gogoduan / GoT-R1
View on GitHub
[ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning
☆106Jan 27, 2026Updated 6 months ago
YuqingWang1029 / CubiD
View on GitHub
[CVPR2026 Highlight] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens https://arxiv.org/abs…
☆63Apr 10, 2026Updated 3 months ago
Yukun-Huang / DreamCube
View on GitHub
[ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".
☆181Feb 4, 2026Updated 5 months ago
YuqingWang1029 / PAR
View on GitHub
[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project
☆186Mar 20, 2025Updated last year
hutaiHang / ATM
View on GitHub
[ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"
☆28Apr 15, 2025Updated last year
HKU-MMLab / Math-VR-CodePlot-CoT
View on GitHub
Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images
☆63Nov 4, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HKU-MMLab / UniClawBench
View on GitHub
UniClawBench project page: https://uniclawbench.github.io/
☆38Updated this week
YuqingWang1029 / TokenBridge
View on GitHub
[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…
☆158Jul 24, 2025Updated last year
InternRobotics / OST-Bench
View on GitHub
[NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
☆80Sep 29, 2025Updated 10 months ago
rongyaofang / prism-bench
View on GitHub
This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…
☆131Jan 29, 2026Updated 6 months ago
rongyaofang / PUMA
View on GitHub
Empowering Unified MLLM with Multi-granular Visual Generation
☆132Jan 16, 2025Updated last year
SilentView / LVD-2M
View on GitHub
[NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"
☆79Oct 15, 2024Updated last year
HKU-MMLab / Macro
View on GitHub
The official repo of "MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data"
☆67Mar 27, 2026Updated 4 months ago
HKU-MMLab / PhysForge
View on GitHub
[ICML 2026] PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World
☆161May 14, 2026Updated 2 months ago
lwq20020127 / OmniDrag
View on GitHub
[IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
☆16Feb 13, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yhyang-myron / DreamComposer
View on GitHub
[CVPR 2024] DreamComposer: Controllable 3D Object Generation via Multi-View Conditions
☆135Jul 22, 2024Updated 2 years ago
yosefdayani / MV-RAG
View on GitHub
MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…
☆23Nov 29, 2025Updated 8 months ago
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 11 months ago
HH-LG / AgeBooth
View on GitHub
Official implementation of “AgeBooth: Controllable Facial Aging and Rejuvenation via Diffusion Models"
☆16Oct 8, 2025Updated 9 months ago
HVision-NKU / StyleExpert
View on GitHub
Official implementation of StyleExpert(CVPR 2026)
☆39Mar 19, 2026Updated 4 months ago
TencentARC / Divot
View on GitHub
Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)
☆87Feb 27, 2025Updated last year
KlingAIResearch / GameFactory
View on GitHub
[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos
☆493Mar 22, 2025Updated last year
qiulu66 / EgoPlan-Bench2
View on GitHub
☆31Apr 11, 2025Updated last year
TencentARC / Moto
View on GitHub
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆180Oct 1, 2025Updated 9 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
HVision-NKU / ControlSR
View on GitHub
☆13Apr 19, 2025Updated last year
KaiyueSun98 / T2V-CompBench
View on GitHub
[CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
☆123Oct 25, 2025Updated 9 months ago
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
yigu1008 / Diffusion-RPO
View on GitHub
☆15Mar 30, 2025Updated last year
CUC-MIPG / Edit-Transfer
View on GitHub
Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"
☆89Jun 6, 2025Updated last year
BerserkerVV / Video2LoRA
View on GitHub
Video2LoRA: Unified Semantic-Controlled Video Generation via Per-Reference-Video LoRA （CVPR 2026 Findings）
☆22May 25, 2026Updated 2 months ago
VAST-AI-Research / HoloPart
View on GitHub
HoloPart: Generative 3D Part Amodal Segmentation
☆663Apr 11, 2025Updated last year