rbalestr-lab / stable-pretrainingLinks
Reliable, minimal and scalable library for pretraining foundation and world models
☆68Updated this week
Alternatives and similar repositories for stable-pretraining
Users that are interested in stable-pretraining are comparing it to the libraries listed below
Sorting:
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆103Updated 2 months ago
- A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/C…☆109Updated 4 months ago
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).☆311Updated 2 months ago
- ☆150Updated last year
- Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…☆271Updated this week
- ☆283Updated last year
- ☆120Updated 4 months ago
- 🧱 Modula software package☆287Updated 2 months ago
- Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…☆63Updated 2 years ago
- Implementation of Diffusion Transformer (DiT) in JAX☆293Updated last year
- ☆217Updated 10 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆313Updated 3 months ago
- Research Project Template Repository☆34Updated last month
- Example of how to use Weights & Biases on Slurm☆118Updated 3 years ago
- Efficient optimizers☆269Updated last week
- WIP☆93Updated last year
- NF-Layers for constructing neural functionals.☆90Updated last year
- Scalable and Stable Parallelization of Nonlinear RNNS☆23Updated last week
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆124Updated last year
- Comparison between GFlowNets & Maximum Entropy RL☆19Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆166Updated 3 months ago
- Code for the paper: Rotating Features for Object Discovery☆53Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆98Updated 2 weeks ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆35Updated 3 years ago
- Implementation of https://srush.github.io/annotated-s4☆503Updated 3 months ago
- Flow-matching algorithms in JAX☆105Updated last year
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆122Updated 6 months ago
- Library that provides metrics to assess representation quality☆16Updated 8 months ago
- Code for our NeurIPS 2022 paper☆369Updated 2 years ago
- Supporting code for the blog post on modular manifolds.☆77Updated 3 weeks ago