Davinci-XLab/STAR-T2I

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Davinci-XLab/STAR-T2I)

Davinci-XLab / STAR-T2I

Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"

☆45

Alternatives and similar repositories for STAR-T2I

Users that are interested in STAR-T2I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Davinci-XLab / V2Flow
View on GitHub
☆19Apr 1, 2025Updated last year
Mowenyii / Uniform-Attention-Maps
View on GitHub
[WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing
☆17Mar 16, 2025Updated last year
krennic999 / STAR
View on GitHub
STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
☆150Feb 19, 2025Updated last year
maxin-cn / Awesome-Autoregressive-Visual-Generation-Models
View on GitHub
a collection of awesome autoregressive visual generation models
☆82Apr 17, 2025Updated last year
zhangguiwei610 / V2Flow
View on GitHub
☆29Mar 30, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Mowenyii / PAE
View on GitHub
[CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation
☆87Jul 13, 2024Updated 2 years ago
taki0112 / pseudo_var
View on GitHub
Pseudo code for "VAR: Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
☆15May 27, 2024Updated 2 years ago
RedShift51 / fast-latent-decoders
View on GitHub
Toward Lightweight and Fast Decoders for Latent Diffusion Models in Image and Video Generation
☆22Dec 26, 2024Updated last year
BarretBa / ICTHP
View on GitHub
Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation
☆45Aug 5, 2025Updated 11 months ago
QY-H00 / Conceptrol
View on GitHub
Conceptrol: Concept Control of Zero-shot Personalized Image Generation
☆47Mar 27, 2025Updated last year
zhengchen1999 / DCT
View on GitHub
☆10Aug 29, 2024Updated last year
Pengchengpcx / FTEdit
View on GitHub
[CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
☆25Aug 23, 2025Updated 11 months ago
lxa9867 / ImageFolder
View on GitHub
High-performance Image Tokenizers for VAR and AR
☆307Apr 25, 2025Updated last year
tyshiwo1 / Accelerating-T2I-AR-with-SJD
View on GitHub
[ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
☆52Apr 21, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
krennic999 / MPI
View on GitHub
Code for paper "Masked Pre-training Enables Universal Zero-shot Denoiser" [NeurIPS 2024].
☆35Nov 20, 2024Updated last year
MegaScenes / web-viewer
View on GitHub
web viewer for 3d reconstructions
☆22Mar 6, 2025Updated last year
ByteVisionLab / TokenFlow
View on GitHub
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
☆464Aug 8, 2025Updated 11 months ago
zhangguiwei610 / CAMEL
View on GitHub
Official implementation for the CVPR 2024 paper CAMEL
☆20Jun 20, 2024Updated 2 years ago
csguoh / FastVAR
View on GitHub
[ICCV2025]Generate one 2K image on single 24GB 3090 GPU!
☆88Sep 8, 2025Updated 10 months ago
Tencent-Hunyuan / iFSQ
View on GitHub
iFSQ & LlamaGen-REPA
☆102Jan 27, 2026Updated 5 months ago
OliverRensu / FlowAR
View on GitHub
“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…
☆171May 1, 2025Updated last year
Xiangtaokong / NSARM
View on GitHub
NSARM: Next-Scale Autoregressive Modeling for Robust Real-World Image Super-Resolution
☆27Oct 17, 2025Updated 9 months ago
zrealli / LDGen
View on GitHub
LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
☆38Mar 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shentt67 / SecureReID
View on GitHub
[TIFS 2024] SecureReID: Privacy-Preserving Anonymization for Person Re-Identification
☆19Mar 9, 2024Updated 2 years ago
ali-vilab / alitok
View on GitHub
[ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model
☆56Oct 12, 2025Updated 9 months ago
wz0919 / EPiC
View on GitHub
[ICML2026] Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
☆50Jun 2, 2025Updated last year
lian700 / SoliReward
View on GitHub
Official Code for "SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models" [CVPR2…
☆21Jul 13, 2026Updated last week
MiracleDance / CAR
View on GitHub
CAR: Controllable AutoRegressive Modeling for Visual Generation
☆129Nov 29, 2024Updated last year
w1oves / hqclip
View on GitHub
[ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets
☆67Aug 6, 2025Updated 11 months ago
bcmi / Granular-GRPO
View on GitHub
[CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models
☆64Jun 1, 2026Updated last month
domyounglee / tf2-reformer
View on GitHub
Reproducing the reformer with tf2
☆16Mar 6, 2021Updated 5 years ago
maple-research-lab / TPDM
View on GitHub
Implementation of "Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation" [CVPR 2025]
☆43Apr 25, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NJU-PCALab / UltraHR-100k
View on GitHub
This is the official repository of UltraHR-100K.
☆45Nov 21, 2025Updated 8 months ago
ziqipang / RandAR
View on GitHub
[CVPR 2025 (Oral)] Open implementation of "RandAR"
☆208Jul 14, 2025Updated last year
LTH14 / mar
View on GitHub
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
☆1,942Feb 20, 2026Updated 5 months ago
taki0112 / denoising-diffusion-gan-Tensorflow
View on GitHub
Tensorflow implementation of "Tackling the Generative Learning Trilemma with Denoising Diffusion GANs" (ICLR 2022 Spotlight)
☆21Aug 3, 2022Updated 3 years ago
shunk031 / training-free-structured-diffusion-guidance
View on GitHub
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…
☆120Mar 29, 2023Updated 3 years ago
xudonmao / PairEdit
View on GitHub
☆26Nov 25, 2025Updated 7 months ago
ByteVisionLab / NextFlow
View on GitHub
NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation
☆331Jan 9, 2026Updated 6 months ago