lixilinx / psgd_torchLinks
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)
β179Updated this week
Alternatives and similar repositories for psgd_torch
Users that are interested in psgd_torch are comparing it to the libraries listed below
Sorting:
- π§± Modula software packageβ225Updated last week
- ASDL: Automatic Second-order Differentiation Library for PyTorchβ189Updated 8 months ago
- β60Updated 3 years ago
- β207Updated 8 months ago
- Parameter-Free Optimizers for Pytorchβ130Updated last year
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwoβ¦β71Updated last month
- A library for unit scaling in PyTorchβ129Updated last month
- LoRA for arbitrary JAX models and functionsβ142Updated last year
- β233Updated 6 months ago
- β17Updated last year
- Accelerated First Order Parallel Associative Scanβ187Updated last year
- β40Updated last year
- β275Updated last year
- JMP is a Mixed Precision library for JAX.β208Updated 6 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabsβ36Updated 2 years ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)β94Updated 8 months ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)β62Updated 4 years ago
- β56Updated 10 months ago
- Experiment of using Tangent to autodiff tritonβ80Updated last year
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvementβ¦β390Updated this week
- Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorchβ111Updated 2 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.β76Updated last year
- Running Jax in PyTorch Lightningβ111Updated 8 months ago
- Efficient optimizersβ256Updated 3 weeks ago
- A functional training loops library for JAXβ88Updated last year
- β43Updated 3 weeks ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offloadβ129Updated 3 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAXβ87Updated last year
- supporting pytorch FSDP for optimizersβ84Updated 8 months ago
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)β14Updated last year