rbalestr-lab / stable-pretrainingLinks
Reliable, minimal and scalable library for pretraining foundation and world models
☆76Updated this week
Alternatives and similar repositories for stable-pretraining
Users that are interested in stable-pretraining are comparing it to the libraries listed below
Sorting:
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).☆320Updated 3 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆105Updated 2 weeks ago
- A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/C…☆110Updated 5 months ago
- ☆283Updated last year
- ☆150Updated last year
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆321Updated 3 months ago
- Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…☆63Updated 2 years ago
- ☆120Updated 4 months ago
- ☆221Updated 11 months ago
- 🧱 Modula software package☆300Updated 2 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆293Updated last year
- A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!☆86Updated this week
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆35Updated 3 years ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆104Updated 10 months ago
- Library that provides metrics to assess representation quality☆17Updated 9 months ago
- Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…☆274Updated this week
- Efficient optimizers☆276Updated 3 weeks ago
- Research Project Template Repository☆36Updated 2 months ago
- Repository of the journal club "Diffusion Models and Generative Modeling"☆17Updated 11 months ago
- ☆50Updated 9 months ago
- 👋 Overcomplete is a Vision-based SAE Toolbox☆98Updated this week
- Code for the paper: Rotating Features for Object Discovery☆53Updated last year
- A comprehensive JAX/NNX library for diffusion and flow matching generative algorithms, featuring DiT (Diffusion Transformer) and its vari…☆115Updated 3 weeks ago
- ☆69Updated 2 years ago
- NF-Layers for constructing neural functionals.☆91Updated last year
- WIP☆93Updated last year
- ☆28Updated last month
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆102Updated last month
- Implementation of https://srush.github.io/annotated-s4☆504Updated 4 months ago
- A curated list for awesome discrete diffusion models resources.☆488Updated 2 months ago