Distributed K-FAC preconditioner for PyTorch
☆95Apr 15, 2026Updated this week
Alternatives and similar repositories for kfac-pytorch
Users that are interested in kfac-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BERT for Distributed PyTorch + AMP Training☆12Mar 15, 2023Updated 3 years ago
- Pytorch implementation of KFAC - this is a port of https://github.com/tensorflow/kfac/☆31Jun 6, 2024Updated last year
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆286Feb 27, 2023Updated 3 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆153Jun 22, 2023Updated 2 years ago
- ☆135Oct 23, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆221Mar 31, 2026Updated 2 weeks ago
- Regularization, Neural Network Training Dynamics☆14Jan 13, 2020Updated 6 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆149Oct 1, 2023Updated 2 years ago
- ☆33Jul 8, 2024Updated last year
- Randomized algorithm class at CU☆15Jul 8, 2025Updated 9 months ago
- PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks☆779Jul 10, 2025Updated 9 months ago
- Pytorch optimizers implementing Hilbert Constrained Gradient Descent☆19May 9, 2019Updated 6 years ago
- ☆30Feb 11, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆14Sep 14, 2021Updated 4 years ago
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆24Nov 4, 2024Updated last year
- code for experiments in Grosse and Salakhutdinov, 2015.☆12Oct 9, 2016Updated 9 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Jul 31, 2023Updated 2 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Apr 6, 2021Updated 5 years ago
- ☆10Apr 29, 2023Updated 2 years ago
- a tool to generate skeleton applications that mimic a real applications' parallel or distributed performance at a task level☆13Jan 11, 2017Updated 9 years ago
- A decentralised application that creates high quality machine learning datasets☆13Jan 22, 2019Updated 7 years ago
- In this project, we propose to study Vision Transformers trained using the Barlow Twins self-supervised method, and compare the results w…☆16Oct 3, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆192Dec 5, 2024Updated last year
- Kira is an astronomy image processing toolkit implemented with Apache Spark.☆15Feb 9, 2016Updated 10 years ago
- Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025☆23Updated this week
- Look for arXiv papers in a Zotero library and find available DOIs of published versions.☆14Apr 11, 2024Updated 2 years ago
- Computing gradients and Hessians of feed-forward networks with GPU acceleration☆20Feb 14, 2024Updated 2 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆143May 26, 2019Updated 6 years ago
- The code for the NeurIPS19 paper and blog on "Uniform convergence may be unable to explain generalization in deep learning".☆10Oct 26, 2019Updated 6 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆184Jul 17, 2021Updated 4 years ago
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆16Sep 15, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆33Dec 3, 2019Updated 6 years ago
- A LARS implementation in PyTorch☆353Feb 21, 2020Updated 6 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆194Apr 3, 2026Updated 2 weeks ago
- ☆13Feb 24, 2020Updated 6 years ago
- ☆15Feb 12, 2021Updated 5 years ago
- An efficient implementation of GPNN☆17Nov 24, 2022Updated 3 years ago
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago