facebookresearch / capiLinks
Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"
☆106Updated last month
Alternatives and similar repositories for capi
Users that are interested in capi are comparing it to the libraries listed below
Sorting:
- WIP☆93Updated 9 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆237Updated 3 months ago
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆129Updated last month
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆173Updated 11 months ago
- ☆31Updated 3 weeks ago
- ☆286Updated last month
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆79Updated 5 months ago
- Train VAE like a boss☆279Updated 7 months ago
- ☆20Updated 7 months ago
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆161Updated 3 months ago
- My take on Flow Matching☆55Updated 4 months ago
- ☆50Updated last year
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆118Updated 4 months ago
- ☆51Updated last month
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆127Updated last year
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆83Updated 3 months ago
- [ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule☆167Updated 2 months ago
- Focused on fast experimentation and simplicity☆73Updated 5 months ago
- When it comes to optimizers, it's always better to be safe than sorry☆233Updated 2 months ago
- VIT inference in triton because, why not?☆28Updated last year
- Scalable and Performant Data Loading☆269Updated this week
- Clarity: A Minimalist Website Template for AI Research☆119Updated 4 months ago
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆143Updated last week
- ☆27Updated last year
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆89Updated last month
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆105Updated 2 months ago
- ☆78Updated 10 months ago
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆63Updated last month
- supporting pytorch FSDP for optimizers☆79Updated 5 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆120Updated 10 months ago