UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.
☆134Apr 2, 2025Updated 11 months ago
Alternatives and similar repositories for unidisc
Users that are interested in unidisc are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…☆100Feb 4, 2026Updated last month
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆121Mar 4, 2025Updated last year
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Apr 27, 2025Updated 10 months ago
- ☆46Nov 20, 2025Updated 3 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69May 18, 2025Updated 9 months ago
- ☆47Apr 20, 2025Updated 10 months ago
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆95Nov 13, 2025Updated 3 months ago
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformer☆355Mar 20, 2025Updated 11 months ago
- ☆190Dec 17, 2024Updated last year
- ☆34Mar 18, 2025Updated 11 months ago
- Blending Custom Photos with Video Diffusion Transformers☆48Jan 21, 2025Updated last year
- [ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆963Jul 10, 2025Updated 7 months ago
- VideoAuteur: Towards Long Narrative Video Generation☆43Oct 22, 2025Updated 4 months ago
- The Superposition of Diffusion Models Using the Itô Density Estimator☆52Mar 20, 2025Updated 11 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- Pytorch implementation of a Variational Autoencoder (VAE) that learns from the MNIST dataset and generates images of altered handwritten …☆20Jan 10, 2023Updated 3 years ago
- Official implementation of Inductive Moment Matching☆574Jul 11, 2025Updated 7 months ago
- ☆63Jul 11, 2025Updated 7 months ago
- ☆28Mar 4, 2025Updated 11 months ago
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆171Feb 18, 2025Updated last year
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated last year
- [ICCV 2025] Official Implementation of Steering Rectified Flow Models in the Vector Field for Controlled Image Generation☆44Jun 27, 2025Updated 8 months ago
- ☆163Jan 6, 2025Updated last year
- This repo contains the code for 1D tokenizer and generator☆1,117Mar 20, 2025Updated 11 months ago
- A comprehensive codebase for training and finetuning Image <> Latent models.☆50Mar 1, 2025Updated last year
- Official implementation of "Perturbed-Attention Guidance"☆60Jul 2, 2024Updated last year
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆88Apr 10, 2025Updated 10 months ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Jul 26, 2025Updated 7 months ago
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆121Jan 2, 2025Updated last year
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆650Oct 16, 2024Updated last year
- ☆39Apr 27, 2024Updated last year
- LoRA for convolution layer☆22Mar 9, 2023Updated 2 years ago
- This package introduces a perceptual loss implementation based on the modern ConvNeXt architecture.☆27Nov 14, 2024Updated last year
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆48Sep 13, 2024Updated last year
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆145Feb 11, 2025Updated last year
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆130Oct 18, 2024Updated last year
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆136Dec 21, 2024Updated last year
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆392Jan 19, 2025Updated last year
- Official Implementation of "Instance Segmentation of Scene Sketches Using Natural Image Priors" (SIGGRAPH 2025)☆87Sep 10, 2025Updated 5 months ago