rbalestr-lab / stable-pretrainingLinks
Reliable, minimal and scalable library for pretraining foundation and world models
☆63Updated this week
Alternatives and similar repositories for stable-pretraining
Users that are interested in stable-pretraining are comparing it to the libraries listed below
Sorting:
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).☆309Updated 2 months ago
- ☆120Updated 3 months ago
- ☆281Updated last year
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆102Updated last month
- ☆150Updated last year
- 🧱 Modula software package☆239Updated last month
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆301Updated 2 months ago
- Modern Fixed Point Systems using Pytorch☆115Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆291Updated last year
- A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/C…☆106Updated 3 months ago
- ☆214Updated 9 months ago
- Efficient optimizers☆261Updated this week
- WIP☆93Updated last year
- Research Project Template Repository☆34Updated last month
- supporting pytorch FSDP for optimizers☆84Updated 9 months ago
- Scalable and Stable Parallelization of Nonlinear RNNS☆22Updated 3 weeks ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆103Updated 8 months ago
- Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…☆270Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆164Updated 3 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆98Updated 6 months ago
- Flow-matching algorithms in JAX☆104Updated last year
- A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!☆85Updated 3 weeks ago
- Implementation of https://srush.github.io/annotated-s4☆502Updated 3 months ago
- Open-source framework for the research and development of foundation models.☆452Updated this week
- ☆58Updated 11 months ago
- Annotated version of the Mamba paper☆490Updated last year
- A simple implimentation of Bayesian Flow Networks (BFN)☆240Updated last year
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆120Updated 5 months ago
- Implementation of PSGD optimizer in JAX☆34Updated 8 months ago
- ☆27Updated last year