baaivision/MUSE-Pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/baaivision/MUSE-Pytorch)

baaivision / MUSE-Pytorch

An in-context conditioning version of MUSE with pre-trained checkpoints.

☆115

Alternatives and similar repositories for MUSE-Pytorch

Users that are interested in MUSE-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zideliu / StyleDrop-PyTorch
View on GitHub
Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)
☆588Aug 23, 2023Updated 2 years ago
aim-uofa / StyleDrop-PyTorch
View on GitHub
This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.
☆226Jul 11, 2023Updated 3 years ago
baaivision / CapsFusion
View on GitHub
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
☆215Feb 27, 2024Updated 2 years ago
donglixp / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆19Jan 3, 2023Updated 3 years ago
huggingface / amused
View on GitHub
☆89Jan 4, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lucidrains / muse-maskgit-pytorch
View on GitHub
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
☆918Feb 29, 2024Updated 2 years ago
baofff / U-ViT
View on GitHub
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
☆1,107Mar 25, 2023Updated 3 years ago
theAdamColton / ijepa-enhanced
View on GitHub
recipe for training fully-featured self supervised image jepa models
☆14Jun 4, 2025Updated last year
CASIA-LMC-Lab / Obj2Seq
View on GitHub
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)
☆85Nov 2, 2022Updated 3 years ago
Zhendong-Wang / Prompt-Diffusion
View on GitHub
Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"
☆414Mar 25, 2024Updated 2 years ago
ThomasMrY / VCT
View on GitHub
[NeurIPS 2022] code for "Visual Concepts Tokenization"
☆23Oct 10, 2022Updated 3 years ago
baaivision / vid2vid-zero
View on GitHub
Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models
☆356Jul 4, 2023Updated 3 years ago
csyxwei / ELITE
View on GitHub
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)
☆541Jan 8, 2024Updated 2 years ago
thu-ml / unidiffuser
View on GitHub
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
☆1,486May 31, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
renwang435 / video-ttt-release
View on GitHub
Test-Time Training on Video Streams
☆70Jul 24, 2023Updated 2 years ago
vigilant-umbrella / wikiHowUnofficialAPI
View on GitHub
API to extract data from wikiHow
☆18Jul 10, 2021Updated 5 years ago
willisma / SiT
View on GitHub
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
☆1,188Dec 22, 2025Updated 7 months ago
sihyun-yu / PVDM
View on GitHub
[CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space
☆322May 14, 2024Updated 2 years ago
Yuheng-Li / PACGen
View on GitHub
☆64Jul 1, 2023Updated 3 years ago
LeapLabTHU / Deep-Incubation
View on GitHub
Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)
☆92Mar 16, 2023Updated 3 years ago
google-research / maskgit
View on GitHub
Official Jax Implementation of MaskGIT
☆562Nov 18, 2022Updated 3 years ago
SHI-Labs / Versatile-Diffusion
View on GitHub
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
☆1,334Aug 10, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LTH14 / mage
View on GitHub
A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
☆582Mar 10, 2023Updated 3 years ago
xiaofeng94 / VL-PLM
View on GitHub
Exploiting unlabeled data with vision and language models for object detection, ECCV 2022
☆97Jan 16, 2024Updated 2 years ago
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,166Mar 20, 2025Updated last year
shape-guided-diffusion / shape-guided-diffusion
View on GitHub
Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024
☆39Aug 19, 2023Updated 2 years ago
Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago
zlab-princeton / UEval
View on GitHub
UEval: A Benchmark for Unified Multimodal Generation
☆24Apr 20, 2026Updated 3 months ago
LijieFan / LaCLIP
View on GitHub
[NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"
☆291Jan 14, 2024Updated 2 years ago
aim-uofa / AutoStory
View on GitHub
[IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
☆149Mar 5, 2026Updated 4 months ago
facebookresearch / long_seq_mae
View on GitHub
code release of research paper "Exploring Long-Sequence Masked Autoencoders"
☆100Oct 14, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
hustvl / RILS
View on GitHub
[CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)
☆44Sep 5, 2023Updated 2 years ago
ai-forever / MoVQGAN
View on GitHub
MoVQGAN - model for the image encoding and reconstruction
☆266Oct 31, 2023Updated 2 years ago
Zeqiang-Lai / Mini-DALLE3
View on GitHub
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
☆313Dec 28, 2023Updated 2 years ago
shunk031 / training-free-structured-diffusion-guidance
View on GitHub
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…
☆120Mar 29, 2023Updated 3 years ago
baaivision / Emu
View on GitHub
Emu Series: Generative Multimodal Models from BAAI
☆1,776Jan 12, 2026Updated 6 months ago
causalfusion / causalfusion
View on GitHub
☆196Dec 17, 2024Updated last year
octoml / deformable-attention-kernel
View on GitHub
TVMScript kernel for deformable attention
☆25Dec 15, 2021Updated 4 years ago