facebookresearch / capiLinks
Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"
☆125Updated 3 weeks ago
Alternatives and similar repositories for capi
Users that are interested in capi are comparing it to the libraries listed below
Sorting:
- WIP☆93Updated last year
- TIPS (ICLR'25): Text-Image Pretraining with Spatial Awareness☆112Updated 9 months ago
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆183Updated last year
- ☆34Updated 8 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆90Updated last year
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆138Updated 4 months ago
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆194Updated 8 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆344Updated 2 months ago
- ☆304Updated 8 months ago
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆281Updated 7 months ago
- ☆827Updated last month
- Train VAE like a boss☆311Updated last year
- Implementation of a multimodal diffusion transformer in Pytorch☆107Updated last year
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆161Updated 3 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆82Updated 7 months ago
- My take on Flow Matching☆90Updated last year
- Focused on fast experimentation and simplicity☆79Updated last year
- Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"☆195Updated 7 months ago
- Python Library to evaluate VLM models' robustness across diverse benchmarks☆220Updated 2 months ago
- Flash Attention Triton kernel with support for second-order derivatives☆129Updated 3 weeks ago
- Clarity: A Minimalist Website Template for AI Research☆175Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 5 months ago
- ☆27Updated 3 months ago
- ConceptAttention: A method for interpreting multi-modal diffusion transformers.☆410Updated last week
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆170Updated 11 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆118Updated last week
- PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning☆231Updated last year
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆163Updated 11 months ago
- Synthetic Alphabet Dataset☆19Updated 9 months ago