xhzhao / PyTorch-MPI-DDP-exampleLinks
PyTorch-MPI-DDP-example
☆18Updated 7 years ago
Alternatives and similar repositories for PyTorch-MPI-DDP-example
Users that are interested in PyTorch-MPI-DDP-example are comparing it to the libraries listed below
Sorting:
- Deep learning with a multiplication budget☆47Updated 7 years ago
- A Chainer extension for K-FAC☆20Updated 6 years ago
- Butterfly matrix multiplication in PyTorch☆177Updated 2 years ago
- Limitations of the Empirical Fisher Approximation☆49Updated 9 months ago
- PyTorch AutoNEB implementation to identify minimum energy paths, e.g. in neural network loss landscapes☆56Updated 3 years ago
- ☆134Updated 8 years ago
- Regularization, Neural Network Training Dynamics☆14Updated 5 years ago
- Block-sparse primitives for PyTorch☆160Updated 4 years ago
- Distributed K-FAC preconditioner for PyTorch☆93Updated this week
- ☆83Updated 5 years ago
- implement distributed machine learning with Pytorch + OpenMPI☆52Updated 6 years ago
- hessian in pytorch☆187Updated 5 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆149Updated 2 years ago
- ☆77Updated 6 years ago
- An implementation of KFAC for TensorFlow☆199Updated 3 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆148Updated 2 years ago
- Structured matrices for compressing neural networks☆67Updated 2 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- Tensor Train decomposition on TensorFlow☆228Updated 4 years ago
- Experiment code for "Randomized Automatic Differentiation"☆67Updated 5 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆142Updated 6 years ago
- Hessian backpropagation (HBP): PyTorch extension of backpropagation for block-diagonal curvature matrix approximations☆21Updated 2 years ago
- CUDA kernels for generalized matrix-multiplication in PyTorch☆85Updated 4 years ago
- ☆30Updated 4 years ago
- Convolutional Neural Tangent Kernel☆112Updated 6 years ago
- Hypergradient descent☆147Updated last year
- Autoregressive Energy Machines☆78Updated 3 years ago
- CoLa - Decentralized Linear Learning: https://arxiv.org/abs/1808.04883☆20Updated 4 years ago
- Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"☆145Updated 8 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆77Updated last year