Wayfarer-Labs / owl-vaesLinks

Weird autoencoder experiments

☆22

Alternatives and similar repositories for owl-vaes

Users that are interested in owl-vaes are comparing it to the libraries listed below

Sorting:

CompVis / tread
☆156Updated 2 weeks ago
yinboc / dito
Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"
☆155Updated 9 months ago
cfifty / rotation_trick
☆151Updated 7 months ago
apple / ml-flextok
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
☆258Updated 5 months ago
fal-ai / diffusion-speedrun
Focused on fast experimentation and simplicity
☆75Updated 10 months ago
Gengzigang / TokenSet
Official PyTorch implementation of TokenSet.
☆126Updated 7 months ago
lucidrains / maskbit-pytorch
Implementation of the proposed MaskBit from Bytedance AI
☆82Updated 11 months ago
yukara-ikemiya / modified-shortcut-models-pytorch
PyTorch implementation of Shortcut Models [Frans, 2025] with little modification
☆58Updated 3 months ago
huggingface / fineVideo
☆90Updated last year
yhli123 / Immiscible-Diffusion
Official Github Repo for Neurips 2024 Paper Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment
☆61Updated 4 months ago
LINs-lab / UCGM
[Preprint] UCGM: Unified Continuous Generative Models
☆169Updated 5 months ago
lumalabs / imm
Official implementation of Inductive Moment Matching
☆561Updated 3 months ago
tzco / Diffusion-wo-CFG
Official Implementation for Diffusion Models Without Classifier-free Guidance
☆155Updated 8 months ago
gstoica27 / DeltaFM
[ICCV 2025] Official Implementation of Contrastive Flow Matching
☆134Updated 4 months ago
ShivamDuggal4 / adaptive-length-tokenizer
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
☆135Updated 8 months ago
EvolvingLMMs-Lab / Aero-1
☆78Updated 5 months ago
cloneofsimo / minDinoV2
☆22Updated last year
facebookresearch / EvalGIM
🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…
☆86Updated 10 months ago
Lakonik / GMFlow
[ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)
☆142Updated 2 weeks ago
lucidrains / titok-pytorch
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
☆181Updated last year
alexanderswerdlow / unidisc
UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…
☆130Updated 7 months ago
lorenmt / clarity-template
Clarity: A Minimalist Website Template for AI Research
☆156Updated 9 months ago
zelaki / eqvae
[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.
☆156Updated 4 months ago
cloneofsimo / min-max-in-dit
☆27Updated last year
apple / ml-tarflow
☆289Updated 10 months ago
sayakpaul / nanoDiT
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
☆131Updated 5 months ago
causalfusion / causalfusion
☆183Updated 10 months ago
lucidrains / multimodal-dit-pytorch
Implementation of a multimodal diffusion transformer in Pytorch
☆106Updated last year
qihao067 / CrossFlow
[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…
☆313Updated 4 months ago
LargeWorldModel / ElasticTok
ElasticTok: Adaptive Tokenization for Image and Video
☆82Updated 11 months ago