huggingface/amused

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huggingface/amused)

huggingface / amused

☆89

Alternatives and similar repositories for amused

Users that are interested in amused are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huggingface / open-muse
View on GitHub
Open reproduction of MUSE for fast text2image generation.
☆358Jun 1, 2024Updated 2 years ago
baaivision / MUSE-Pytorch
View on GitHub
An in-context conditioning version of MUSE with pre-trained checkpoints.
☆115Jun 4, 2023Updated 3 years ago
zhangjiewu / awesome-t2i-eval
View on GitHub
A curated list of papers and resources for text-to-image evaluation.
☆30Sep 6, 2023Updated 2 years ago
xie-lab-ml / Meissonic-Inference
View on GitHub
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
☆16Nov 21, 2024Updated last year
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
nipunjindal / diffusers-layout-guidance
View on GitHub
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".
☆42May 24, 2023Updated 3 years ago
ariG23498 / timm-wrapper-examples
View on GitHub
Notebooks to demonstrate TimmWrapper
☆17Jan 16, 2025Updated last year
openai / consistencydecoder
View on GitHub
Consistency Distilled Diff VAE
☆2,213Nov 7, 2023Updated 2 years ago
lyn-rgb / FreeU_Diffusers
View on GitHub
"FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers
☆103Oct 6, 2023Updated 2 years ago
xvjiarui / GroupViT
View on GitHub
GroupViT: Semantic Segmentation Emerges from Text Supervision
☆25Dec 15, 2022Updated 3 years ago
ShivamDuggal4 / adaptive-length-tokenizer
View on GitHub
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
☆146Feb 11, 2025Updated last year
Alpha-VLLM / Lumina-mGPT
View on GitHub
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…
☆646Oct 16, 2025Updated 9 months ago
yuval-alaluf / Attend-and-Excite
View on GitHub
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
☆771Jan 26, 2024Updated 2 years ago
jy0205 / LaVIT
View on GitHub
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
☆603Oct 6, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
fusiming3 / MARS
View on GitHub
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
☆86Jul 16, 2024Updated 2 years ago
valeoai / Halton-MaskGIT
View on GitHub
[ICLR2025] Halton Scheduler for Masked Generative Image Transformer
☆286Oct 28, 2025Updated 8 months ago
VILA-Lab / i-mae
View on GitHub
i-mae Pytorch Repo
☆20Apr 6, 2024Updated 2 years ago
baaivision / CapsFusion
View on GitHub
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
☆215Feb 27, 2024Updated 2 years ago
zwx8981 / PerceptualAttack_BIQA
View on GitHub
[NeurIPS2022] Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop
☆13Apr 13, 2023Updated 3 years ago
AILab-CVC / FreeNoise
View on GitHub
[ICLR 2024] Code for FreeNoise based on VideoCrafter
☆429Aug 25, 2025Updated 10 months ago
Davinci-XLab / V2Flow
View on GitHub
☆19Apr 1, 2025Updated last year
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,018Nov 25, 2025Updated 7 months ago
mihirp1998 / AlignProp
View on GitHub
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…
☆324Nov 1, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
eclipse-t2i / lambda-eclipse-inference
View on GitHub
[TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…
☆53Nov 29, 2024Updated last year
tgxs002 / HPSv2
View on GitHub
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
☆677May 24, 2024Updated 2 years ago
Qrange-group / SUR-adapter
View on GitHub
ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…
☆120Sep 4, 2025Updated 10 months ago
snap-research / 3dgp
View on GitHub
3D generation on ImageNet [ICLR 2023]
☆213May 23, 2023Updated 3 years ago
obvious-research / phenaki-cvivit
View on GitHub
Reproduction of the first step in the text-to-video model Phenaki. Code and model weights for the Transformer-based autoencoder for video…
☆29Aug 4, 2023Updated 2 years ago
lyndonzheng / CVQ-VAE
View on GitHub
[ICCV 2023] Online Clustered Codebook
☆189Sep 19, 2024Updated last year
ritwikraha / Introduction-to-Image-Processing
View on GitHub
This repository is for anyone who is new to image processing and is super excited by the topic.
☆16Dec 30, 2024Updated last year
HighCWu / control-lora-v2
View on GitHub
ControlLoRA Version 2: A Lightweight Neural Network To Control Stable Diffusion Spatial Information Version 2
☆111Jul 31, 2024Updated last year
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,299Oct 31, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
eclipse-t2i / eclipse-inference
View on GitHub
[CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"
☆65May 1, 2024Updated 2 years ago
xie-lab-ml / IV-mixed-Sampler
View on GitHub
[ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
☆39Feb 17, 2025Updated last year
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,166Mar 20, 2025Updated last year
markweberdev / maskbit
View on GitHub
Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"
☆94Apr 10, 2025Updated last year
thu-ml / unidiffuser
View on GitHub
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
☆1,486May 31, 2023Updated 3 years ago
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,679Mar 16, 2025Updated last year
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year