gpauloski/kfac-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gpauloski/kfac-pytorch)

gpauloski / kfac-pytorch

Distributed K-FAC preconditioner for PyTorch

☆95

Alternatives and similar repositories for kfac-pytorch

Users that are interested in kfac-pytorch are comparing it to the libraries listed below

Sorting:

alecwangcq / KFAC-Pytorch
View on GitHub
Pytorch implementation of KFAC and E-KFAC (Natural Gradient).
☆133Jul 2, 2019Updated 6 years ago
amirgholami / adahessian
View on GitHub
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning
☆283Feb 27, 2023Updated 3 years ago
tyohei / chainerkfac
View on GitHub
A Chainer extension for K-FAC
☆20Jun 16, 2019Updated 6 years ago
f-dangel / backpack
View on GitHub
BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.
☆606Nov 28, 2025Updated 3 months ago
fL0n9 / SKFAC-MindSpore
View on GitHub
SKFAC Preconditioner for MindSpore
☆12Jul 2, 2021Updated 4 years ago
Thrandis / EKFAC-pytorch
View on GitHub
Repository containing Pytorch code for EKFAC and K-FAC perconditioners.
☆153Jun 22, 2023Updated 2 years ago
tfjgeorge / nngeometry
View on GitHub
{KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch
☆221Feb 9, 2026Updated 3 weeks ago
cybertronai / pytorch-sso
View on GitHub
PyTorch-SSO: Scalable Second-Order methods in PyTorch
☆148Oct 1, 2023Updated 2 years ago
google-deepmind / kfac-jax
View on GitHub
Second Order Optimization and Curvature Estimation with K-FAC in JAX.
☆314Feb 25, 2026Updated last week
amirgholami / PyHessian
View on GitHub
PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks
☆777Jul 10, 2025Updated 7 months ago
tensorflow / kfac
View on GitHub
An implementation of KFAC for TensorFlow
☆199Feb 11, 2022Updated 4 years ago
gd-zhang / Weight-Decay
View on GitHub
Regularization, Neural Network Training Dynamics
☆14Jan 13, 2020Updated 6 years ago
KordingLab / hilbert-constrained-gradient-descent
View on GitHub
Pytorch optimizers implementing Hilbert Constrained Gradient Descent
☆19May 9, 2019Updated 6 years ago
IST-DASLab / M-FAC
View on GitHub
Efficient reference implementations of the static & dynamic M-FAC algorithms (for pruning and optimization)
☆17Feb 23, 2022Updated 4 years ago
Kid-key / MimicNorm
View on GitHub
☆19Jan 27, 2021Updated 5 years ago
elizabethnewman / hessQuik
View on GitHub
Computing gradients and Hessians of feed-forward networks with GPU acceleration
☆20Feb 14, 2024Updated 2 years ago
stephenbeckr / randomized-algorithm-class
View on GitHub
Randomized algorithm class at CU
☆15Jul 8, 2025Updated 7 months ago
kunimi00 / ContrastiveSSLMusicAudio
View on GitHub
☆13Jun 2, 2022Updated 3 years ago
r-three / mats
View on GitHub
☆33Jul 8, 2024Updated last year
foxlf823 / DilatedRnn
View on GitHub
A PyTorch implement of Dilated RNN
☆11Dec 31, 2017Updated 8 years ago
HKBU-HPML / OMGS-SGD
View on GitHub
Layer-wise Sparsification of Distributed Deep Learning
☆10Jul 6, 2020Updated 5 years ago
apple / ml-tic-lm
View on GitHub
Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025
☆21Feb 27, 2026Updated last week
hydra-hoard / hydra
View on GitHub
A decentralised application that creates high quality machine learning datasets
☆13Jan 22, 2019Updated 7 years ago
SirRob1997 / Crowded-Valley---Results
View on GitHub
This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"
☆184Jul 17, 2021Updated 4 years ago
kazukiosawa / pipe-fisher
View on GitHub
☆10Apr 29, 2023Updated 2 years ago
Kaffaljidhmah2 / SpecDec_pp
View on GitHub
Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
☆15Jul 10, 2025Updated 7 months ago
LongLong-Jing / awesome-unsupervised-learning
View on GitHub
awesome unsupervised learning paper list
☆12Jan 4, 2018Updated 8 years ago
lzhangbv / eva
View on GitHub
[ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation
☆12Jul 31, 2023Updated 2 years ago
locuslab / uniform-convergence-NeurIPS19
View on GitHub
The code for the NeurIPS19 paper and blog on "Uniform convergence may be unable to explain generalization in deep learning".
☆10Oct 26, 2019Updated 6 years ago
kimihe / Falcon
View on GitHub
A computation-parallel deep learning architecture.
☆13Sep 25, 2019Updated 6 years ago
Chavdarova / LAGAN-Lookahead_Minimax
View on GitHub
Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.
☆15Mar 28, 2021Updated 4 years ago
Lifelong-ML / LASEM
View on GitHub
Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"
☆12Aug 17, 2021Updated 4 years ago
rwightman / py-lmdb
View on GitHub
Universal Python binding for the LMDB 'Lightning' Database
☆13Nov 7, 2017Updated 8 years ago
YukeWang96 / DSXplore_IPDPS21
View on GitHub
Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.
☆13Apr 6, 2021Updated 4 years ago
modestyachts / cifar-10.2
View on GitHub
Host CIFAR-10.2 Data Set
☆13Sep 22, 2021Updated 4 years ago
ParCIS / Ok-Topk
View on GitHub
Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…
☆27Dec 10, 2022Updated 3 years ago
frank-xwang / CLD-UnsupervisedLearning
View on GitHub
[CVPR 2021] Code release for "Unsupervised Feature Learning by Cross-Level Instance-Group Discrimination."
☆101May 8, 2022Updated 3 years ago
lzhangbv / dear_pytorch
View on GitHub
[ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining
☆12Dec 4, 2023Updated 2 years ago
2prime / OpenBlackBox
View on GitHub
☆12Nov 5, 2019Updated 6 years ago