jxbz / madamLinks
👩 Pytorch and Jax code for the Madam optimiser.
☆51Updated 4 years ago
Alternatives and similar repositories for madam
Users that are interested in madam are comparing it to the libraries listed below
Sorting:
- 🧀 Pytorch code for the Fromage optimiser.☆124Updated 11 months ago
- ☆153Updated 5 years ago
- Very deep VAEs in JAX/Flax☆46Updated 4 years ago
- Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network☆64Updated 4 years ago
- Experiment code for "Randomized Automatic Differentiation"☆67Updated 4 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 4 years ago
- A selection of neural network models ported from torchvision for JAX & Flax.☆44Updated 4 years ago
- Loss Patterns of Neural Networks☆85Updated 3 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆59Updated 5 years ago
- Structured matrices for compressing neural networks☆67Updated last year
- ☆99Updated 3 years ago
- Code for the Thermodynamic Variational Objective☆26Updated 3 years ago
- Codebase for Learning Invariances in Neural Networks☆95Updated 2 years ago
- Autoregressive Energy Machines☆78Updated 2 years ago
- ☆45Updated 5 years ago
- ☆68Updated last year
- Normalizing Flows in Jax☆108Updated 4 years ago
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- A discrete sequential VAE☆40Updated 5 years ago
- Toy implementations of some popular ML optimizers using Python/JAX☆44Updated 4 years ago
- ☆13Updated 4 years ago
- ☆26Updated 6 years ago
- ☆78Updated 5 years ago
- ☆32Updated 6 years ago
- A lightweight library for tensorflow 2.0☆66Updated 5 years ago
- Code for: Implicit Competitive Regularization in GANs☆114Updated 3 years ago
- Pretrained TorchVision models on CIFAR10 dataset (with weights)☆24Updated 4 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- Code to accompany the paper Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning☆33Updated 5 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago