ali-vilab/TTS-VAR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ali-vilab/TTS-VAR)

ali-vilab / TTS-VAR

Test-time Scaling for VAR models

☆33

Alternatives and similar repositories for TTS-VAR

Users that are interested in TTS-VAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ali-vilab / DreamVideo-Omni
View on GitHub
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
☆16May 27, 2026Updated last month
ali-vilab / ProMoE
View on GitHub
[ICLR2026] The official code of "Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance"
☆46Mar 23, 2026Updated 4 months ago
ali-vilab / DreamRelation
View on GitHub
[ICCV2025] The official code of "DreamRelation: Relation-Centric Video Customization"
☆27Feb 4, 2026Updated 5 months ago
qiuk2 / RobusTok
View on GitHub
Image Tokenizer Needs Post-Training
☆24Oct 4, 2025Updated 9 months ago
HKU-MMLab / UniClawBench
View on GitHub
UniClawBench project page: https://uniclawbench.github.io/
☆37Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
dingdongwang / MMSU
View on GitHub
[ICLR 2026] | MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark
☆17Feb 12, 2026Updated 5 months ago
viiika / Prism
View on GitHub
[ICML 2026] Official Implementation of Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diff…
☆22Mar 4, 2026Updated 4 months ago
ali-vilab / DiffCamera
View on GitHub
[SIGGRAPH Asia 2025] DiffCamera: Arbitrary Refocusing on Images
☆16Jan 26, 2026Updated 5 months ago
yizhou42 / MfH
View on GitHub
☆19May 15, 2026Updated 2 months ago
shim0114 / T2V-Diffusion-Search
View on GitHub
[NeurIPS 2025] Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search
☆18Feb 24, 2026Updated 5 months ago
aimagelab / VHS
View on GitHub
[CVPR2026 Findings] VHS: Verifier on Hidden States, an efficient inference-time scaling verification framework for DiT-based image genera…
☆16Mar 25, 2026Updated 4 months ago
HKU-MMLab / Macro
View on GitHub
The official repo of "MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data"
☆67Mar 27, 2026Updated 3 months ago
TAU-VAILab / BlendedPC
View on GitHub
Blended Point Cloud Diffusion for Localized Text-guided Shape Editing
☆17Jul 2, 2026Updated 3 weeks ago
shawnricecake / draft-attention
View on GitHub
Code for Draft Attention
☆103May 22, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
qishisuren123 / S2L-PO
View on GitHub
[ICML 2026] Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO
☆19Jun 15, 2026Updated last month
JIA-Lab-research / DreamOmni3
View on GitHub
This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''
☆40Dec 30, 2025Updated 6 months ago
tianzhuotao / CAC
View on GitHub
☆32Mar 24, 2023Updated 3 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
JIA-Lab-research / VisionThink
View on GitHub
[NeurIPS 2025] Efficient Reasoning Vision Language Models
☆460Sep 18, 2025Updated 10 months ago
ByteVisionLab / DetailFlow
View on GitHub
🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"
☆170Jul 10, 2025Updated last year
tiagofrepereira2012 / gradients_without_backpropagation
View on GitHub
☆12Feb 23, 2022Updated 4 years ago
Jumpat / tigon
View on GitHub
Official repository of Text-Image Conditioned 3D Generation (TIGON, CVPR 2026)
☆27Updated this week
whiteinblue / EarthCrafter
View on GitHub
☆40Mar 17, 2026Updated 4 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
YujiaHu1109 / IEAP
View on GitHub
[NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models
☆118Sep 27, 2025Updated 9 months ago
FPSG-UIUC / micro23-teaal-artifact
View on GitHub
MICRO 2023 Evaluation Artifact for TeAAL
☆11Oct 26, 2023Updated 2 years ago
thunderbolt215 / UniPercept
View on GitHub
[ICML2026 Spotlight] UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
☆157Jul 13, 2026Updated last week
HorizonWind2004 / reconstruction-alignment
View on GitHub
[ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potenti…
☆411May 23, 2026Updated 2 months ago
dc-ai-projects / DC-AR
View on GitHub
☆83Oct 18, 2025Updated 9 months ago
jinxiang-liu / anno-free-AVS
View on GitHub
Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"
☆38Oct 11, 2024Updated last year
KaiyueSun98 / T2I-ReasonBench
View on GitHub
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
☆37Sep 16, 2025Updated 10 months ago
ByteVisionLab / NextFlow
View on GitHub
NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation
☆331Jan 9, 2026Updated 6 months ago
TIGER-AI-Lab / VISTA
View on GitHub
The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]
☆20Feb 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YuqingWang1029 / CubiD
View on GitHub
[CVPR2026 Highlight] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens https://arxiv.org/abs…
☆63Apr 10, 2026Updated 3 months ago
Zechao-Guan / TopoDiT-3D
View on GitHub
☆15May 13, 2025Updated last year
MajorDavidZhang / Generalization_unified_VLM
View on GitHub
☆24May 23, 2025Updated last year
ant-research / M2-Miner
View on GitHub
[ICLR 2026] M2-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining
☆55Apr 22, 2026Updated 3 months ago
GAIR-NLP / Med
View on GitHub
[ICML 2026] What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-…
☆22May 15, 2026Updated 2 months ago
qishisuren123 / AnyCap
View on GitHub
A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…
☆54Jul 24, 2025Updated last year
ali-vilab / alitok
View on GitHub
[ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model
☆56Oct 12, 2025Updated 9 months ago