☆88Jan 4, 2024Updated 2 years ago
Alternatives and similar repositories for amused
Users that are interested in amused are comparing it to the libraries listed below
Sorting:
- Open reproduction of MUSE for fast text2image generation.☆359Jun 1, 2024Updated last year
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆116Jun 4, 2023Updated 2 years ago
- A curated list of papers and resources for text-to-image evaluation.☆30Sep 6, 2023Updated 2 years ago
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- Consistency Distilled Diff VAE☆2,209Nov 7, 2023Updated 2 years ago
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers☆102Oct 6, 2023Updated 2 years ago
- i-mae Pytorch Repo☆20Apr 6, 2024Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42May 24, 2023Updated 2 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆312Nov 1, 2024Updated last year
- ☆12Aug 30, 2022Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆145Feb 11, 2025Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆763Jan 26, 2024Updated 2 years ago
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆603Oct 6, 2024Updated last year
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- [NeurIPS2022] Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop☆14Apr 13, 2023Updated 2 years ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- ☆11Mar 19, 2024Updated last year
- [ICCV 2023] Online Clustered Codebook☆183Sep 19, 2024Updated last year
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆638Oct 16, 2025Updated 4 months ago
- ControlLoRA Version 2: A Lightweight Neural Network To Control Stable Diffusion Spatial Information Version 2☆112Jul 31, 2024Updated last year
- Reproduction of the first step in the text-to-video model Phenaki. Code and model weights for the Transformer-based autoencoder for video…☆29Aug 4, 2023Updated 2 years ago
- ☆95Jul 24, 2025Updated 7 months ago
- Udacity Deep Learning☆11Mar 11, 2016Updated 9 years ago
- A collection of resources and papers on diffusion models of video generation.☆10Feb 11, 2023Updated 3 years ago
- ☆12Feb 22, 2024Updated 2 years ago
- Notebooks to demonstrate TimmWrapper☆16Jan 16, 2025Updated last year
- [ICML 2025] Diff-MoE: Diffusion Transformer with Time-Aware and Space-Adaptive Experts☆26Nov 10, 2025Updated 3 months ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- Android project that is using FastSAM model for segment anything with live camera feed and gallery images.☆11Nov 9, 2025Updated 3 months ago
- MoVQGAN - model for the image encoding and reconstruction☆260Oct 31, 2023Updated 2 years ago
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆39Feb 17, 2025Updated last year
- BindDiffusion: One Diffusion Model to Bind Them All☆164May 19, 2023Updated 2 years ago
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- GroupViT: Semantic Segmentation Emerges from Text Supervision☆25Dec 15, 2022Updated 3 years ago
- ☆11Oct 4, 2023Updated 2 years ago
- This repo consists of code for plotting top loss images☆13May 18, 2020Updated 5 years ago
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆213Feb 27, 2024Updated 2 years ago