Wayfarer-Labs / owl-vaesLinks
Weird autoencoder experiments
☆22Updated this week
Alternatives and similar repositories for owl-vaes
Users that are interested in owl-vaes are comparing it to the libraries listed below
Sorting:
- ☆156Updated 2 weeks ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆155Updated 9 months ago
- ☆151Updated 7 months ago
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆258Updated 5 months ago
- Focused on fast experimentation and simplicity☆75Updated 10 months ago
- Official PyTorch implementation of TokenSet.☆126Updated 7 months ago
- Implementation of the proposed MaskBit from Bytedance AI☆82Updated 11 months ago
- PyTorch implementation of Shortcut Models [Frans, 2025] with little modification☆58Updated 3 months ago
- ☆90Updated last year
- Official Github Repo for Neurips 2024 Paper Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment☆61Updated 4 months ago
- [Preprint] UCGM: Unified Continuous Generative Models☆169Updated 5 months ago
- Official implementation of Inductive Moment Matching☆561Updated 3 months ago
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆155Updated 8 months ago
- [ICCV 2025] Official Implementation of Contrastive Flow Matching☆134Updated 4 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆135Updated 8 months ago
- ☆78Updated 5 months ago
- ☆22Updated last year
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆86Updated 10 months ago
- [ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)☆142Updated 2 weeks ago
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆181Updated last year
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆130Updated 7 months ago
- Clarity: A Minimalist Website Template for AI Research☆156Updated 9 months ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆156Updated 4 months ago
- ☆27Updated last year
- ☆289Updated 10 months ago
- Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.☆131Updated 5 months ago
- ☆183Updated 10 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆106Updated last year
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆313Updated 4 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆82Updated 11 months ago