samuela / git-re-basinLinks

Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"

☆486

Alternatives and similar repositories for git-re-basin

Users that are interested in git-re-basin are comparing it to the libraries listed below

Sorting:

kach / gradient-descent-the-ultimate-optimizer
Code for our NeurIPS 2022 paper
☆369Updated 2 years ago
facebookresearch / dadaptation
D-Adaptation for SGD, Adam and AdaGrad
☆524Updated 6 months ago
facebookresearch / torchdim
Named tensors with first-class dimensions for PyTorch
☆332Updated 2 years ago
facebookresearch / FFCV-SSL
FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.
☆208Updated 2 years ago
changjonathanc / minLoRA
minLoRA: a minimal PyTorch library that allows you to apply LoRA to any PyTorch model.
☆470Updated 2 years ago
xl0 / lovely-tensors
Tensors, for human consumption
☆1,271Updated last month
archinetai / surgeon-pytorch
A library to inspect and extract intermediate layers of PyTorch models.
☆473Updated 3 years ago
themrzmaster / git-re-basin-pytorch
Git Re-Basin: Merging Models modulo Permutation Symmetries in PyTorch
☆77Updated 2 years ago
OATML / RHO-Loss
☆208Updated 2 years ago
google / learned_optimization
☆783Updated 2 months ago
DarshanDeshpande / jax-models
Unofficial JAX implementations of deep learning research papers
☆156Updated 3 years ago
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆389Updated this week
leopard-ai / betty
Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization
☆343Updated last year
brohrer / sharpened-cosine-similarity
An alternative to convolution in neural networks
☆256Updated last year
KellerJordan / cifar10-airbench
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆275Updated 3 weeks ago
HazyResearch / H3
Language Modeling with the H3 State Space Model
☆519Updated last year
f-dangel / cockpit
Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
☆480Updated 3 years ago
jxbz / agd
Automatic gradient descent
☆208Updated 2 years ago
BlackHC / toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
☆437Updated 11 months ago
libffcv / ffcv-imagenet
Train ImageNet *fast* in 500 lines of code with FFCV
☆144Updated last year
apple / ml-sigma-reparam
☆307Updated last year
stanislavfort / dissect-git-re-basin
Replicating and dissecting the git-re-basin project in one-click-replication Colabs
☆36Updated 2 years ago
srush / annotated-s4
Implementation of https://srush.github.io/annotated-s4
☆500Updated last month
matthias-wright / flaxmodels
Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.
☆255Updated 4 months ago
HomebrewML / revlib
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
☆128Updated 3 years ago
tysam-code / hlb-CIFAR10
Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)
☆1,274Updated 7 months ago
kyegomez / Sophia
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
☆378Updated last year
TorchJD / torchjd
Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…
☆264Updated this week
google-deepmind / tracr
☆540Updated last year
tmabraham / diffusion_reading_group
Diffusion Reading Group at EleutherAI
☆324Updated 2 years ago