lixilinx / psgd_torchLinks

Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)

☆180

Alternatives and similar repositories for psgd_torch

Users that are interested in psgd_torch are comparing it to the libraries listed below

Sorting:

modula-systems / modula
🧱 Modula software package
☆216Updated last week
cgarciae / einop
☆60Updated 3 years ago
nikhilvyas / SOAP
☆206Updated 8 months ago
bremen79 / parameterfree
Parameter-Free Optimizers for Pytorch
☆130Updated last year
kazukiosawa / asdl
ASDL: Automatic Second-order Differentiation Library for PyTorch
☆188Updated 8 months ago
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆140Updated last year
cgarciae / ciclo
A functional training loops library for JAX
☆88Updated last year
google-research / jaxpruner
☆232Updated 5 months ago
samuela / torch2jax
Run PyTorch in JAX. 🤝
☆268Updated this week
google-deepmind / jmp
JMP is a Mixed Precision library for JAX.
☆207Updated 6 months ago
mgrankin / minGPT
minGPT in JAX
☆48Updated 3 years ago
shikaiqiu / compute-better-spent
☆53Updated 10 months ago
young-geng / scalax
A simple library for scaling up JAX programs
☆140Updated 9 months ago
johnryan465 / pscan
☆40Updated last year
google-deepmind / dks
Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…
☆71Updated last month
n2cholas / jax-resnet
Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).
☆112Updated 3 years ago
GallagherCommaJack / modulax
☆17Updated 11 months ago
DarshanDeshpande / jax-models
Unofficial JAX implementations of deep learning research papers
☆156Updated 3 years ago
ludwigwinkler / JaxLightning
Running Jax in PyTorch Lightning
☆109Updated 7 months ago
glassroom / heinsen_sequence
Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)
☆94Updated 8 months ago
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆389Updated this week
vvvm23 / mamba-jax
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆85Updated last year
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆80Updated last year
paganpasta / eqxvision
A Python package of computer vision models for the Equinox ecosystem.
☆107Updated last year
BirkhoffG / jax-dataloader
Pytorch-like dataloaders for JAX.
☆94Updated 2 months ago
graphcore-research / unit-scaling
A library for unit scaling in PyTorch
☆128Updated 3 weeks ago
HomebrewML / HeavyBall
Efficient optimizers
☆252Updated last week
stanislavfort / dissect-git-re-basin
Replicating and dissecting the git-re-basin project in one-click-replication Colabs
☆36Updated 2 years ago
google-deepmind / nanodo
☆275Updated last year
proger / accelerated-scan
Accelerated First Order Parallel Associative Scan
☆184Updated 11 months ago