rbalestr-lab / stable-pretrainingLinks

Reliable, minimal and scalable library for pretraining foundation and world models

☆110

Alternatives and similar repositories for stable-pretraining

Users that are interested in stable-pretraining are comparing it to the libraries listed below

Sorting:

Prisma-Multimodal / ViT-Prisma
ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).
☆327Updated 4 months ago
BlackHC / neural_net_checklist
☆210Updated last year
CLAIRE-Labo / python-ml-research-template
A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/C…
☆113Updated 6 months ago
KellerJordan / cifar10-airbench
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆334Updated last month
autonomousvision / akorn
[ICLR'25] Artificial Kuramoto Oscillatory Neurons
☆106Updated last month
lucmos / relreps
Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…
☆64Updated 2 years ago
stanislavfort / dissect-git-re-basin
Replicating and dissecting the git-re-basin project in one-click-replication Colabs
☆36Updated 3 years ago
BARL-SSL / reptrix
Library that provides metrics to assess representation quality
☆20Updated 10 months ago
nikhilvyas / SOAP
☆229Updated last year
kvfrans / splus
☆122Updated 6 months ago
modula-systems / modula
🧱 Modula software package
☆316Updated 4 months ago
google-deepmind / nanodo
☆285Updated last year
TorchJD / torchjd
Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…
☆287Updated last week
KempnerInstitute / overcomplete
👋 Overcomplete is a Vision-based SAE Toolbox
☆109Updated 2 weeks ago
elyall / wandb_on_slurm
Example of how to use Weights & Biases on Slurm
☆118Updated 3 years ago
team-approx-bayes / ivon
IVON optimizer for neural networks based on variational learning.
☆75Updated last year
facebookresearch / FFCV-SSL
FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.
☆210Updated 2 years ago
bremen79 / parameterfree
Parameter-Free Optimizers for Pytorch
☆130Updated last year
willisma / diffuse_nnx
A comprehensive JAX/NNX library for diffusion and flow matching generative algorithms, featuring DiT (Diffusion Transformer) and its vari…
☆122Updated 2 months ago
mila-iqia / ResearchTemplate
Research Project Template Repository
☆37Updated 3 months ago
HomebrewML / HeavyBall
Efficient optimizers
☆277Updated last month
facebookresearch / capi
Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"
☆125Updated 8 months ago
shikaiqiu / compute-better-spent
☆62Updated last year
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆298Updated last year
srush / annotated-s4
Implementation of https://srush.github.io/annotated-s4
☆509Updated 6 months ago
bartbussmann / matryoshka_sae
☆55Updated 11 months ago
MinghuiChen43 / awesome-deep-phenomena
A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...
☆382Updated last week
locuslab / torchdeq
Modern Fixed Point Systems using Pytorch
☆125Updated 2 years ago
coallaoh / Principles
☆223Updated this week
lixilinx / psgd_torch
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆188Updated this week