facebookresearch / capiLinks
Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"
☆110Updated 2 months ago
Alternatives and similar repositories for capi
Users that are interested in capi are comparing it to the libraries listed below
Sorting:
- WIP☆93Updated 10 months ago
- Train VAE like a boss☆281Updated 8 months ago
- Focused on fast experimentation and simplicity☆74Updated 6 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆251Updated 3 months ago
- ☆32Updated last month
- ☆64Updated 2 months ago
- ☆50Updated last year
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆173Updated last year
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆145Updated last month
- ☆20Updated 8 months ago
- [ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule☆173Updated 3 months ago
- ☆286Updated 2 months ago
- ☆78Updated 11 months ago
- supporting pytorch FSDP for optimizers☆82Updated 6 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆81Updated 6 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆127Updated last year
- When it comes to optimizers, it's always better to be safe than sorry☆241Updated 2 months ago
- My take on Flow Matching☆63Updated 5 months ago
- ☆27Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆91Updated 2 months ago
- Implementations of attention with the softpick function, naive and FlashAttention-2☆77Updated last month
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆138Updated 4 months ago
- ☆74Updated 8 months ago
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆64Updated 2 weeks ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆140Updated last month
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆125Updated 4 months ago
- Clarity: A Minimalist Website Template for AI Research☆124Updated 5 months ago
- ☆208Updated 2 weeks ago
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆284Updated 3 weeks ago
- ☆51Updated last year