rbalestr-lab / stable-pretrainingLinks
Reliable, minimal and scalable library for pretraining foundation and world models
☆110Updated last month
Alternatives and similar repositories for stable-pretraining
Users that are interested in stable-pretraining are comparing it to the libraries listed below
Sorting:
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).☆327Updated 4 months ago
- ☆210Updated last year
- A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/C…☆113Updated 6 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆334Updated last month
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆106Updated last month
- Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…☆64Updated 2 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 3 years ago
- Library that provides metrics to assess representation quality☆20Updated 10 months ago
- ☆229Updated last year
- ☆122Updated 6 months ago
- 🧱 Modula software package☆316Updated 4 months ago
- ☆285Updated last year
- Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…☆287Updated last week
- 👋 Overcomplete is a Vision-based SAE Toolbox☆109Updated 2 weeks ago
- Example of how to use Weights & Biases on Slurm☆118Updated 3 years ago
- IVON optimizer for neural networks based on variational learning.☆75Updated last year
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆210Updated 2 years ago
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- A comprehensive JAX/NNX library for diffusion and flow matching generative algorithms, featuring DiT (Diffusion Transformer) and its vari…☆122Updated 2 months ago
- Research Project Template Repository☆37Updated 3 months ago
- Efficient optimizers☆277Updated last month
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆125Updated 8 months ago
- ☆62Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆298Updated last year
- Implementation of https://srush.github.io/annotated-s4☆509Updated 6 months ago
- ☆55Updated 11 months ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆382Updated last week
- Modern Fixed Point Systems using Pytorch☆125Updated 2 years ago
- ☆223Updated this week
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆188Updated this week