jxbz / madam
👩 Pytorch and Jax code for the Madam optimiser.
☆51Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for madam
- 🧀 Pytorch code for the Fromage optimiser.☆122Updated 4 months ago
- Experiment code for "Randomized Automatic Differentiation"☆67Updated 4 years ago
- Very deep VAEs in JAX/Flax☆45Updated 3 years ago
- ☆45Updated 5 years ago
- Code to accompany the paper Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning☆33Updated 4 years ago
- Pretrained TorchVision models on CIFAR10 dataset (with weights)☆24Updated 4 years ago
- The Singular Values of Convolutional Layers☆71Updated 6 years ago
- Code for: Implicit Competitive Regularization in GANs☆114Updated 2 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆58Updated 4 years ago
- A discrete sequential VAE☆38Updated 4 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (affine group preconditioner, low-rank approximation preconditioner …☆128Updated last month
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆104Updated 2 years ago
- A selection of neural network models ported from torchvision for JAX & Flax.☆44Updated 3 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆39Updated 4 years ago
- Autoregressive Energy Machines☆77Updated 2 years ago
- ☆155Updated 4 years ago
- Structured matrices for compressing neural networks☆67Updated last year
- Jupyter Notebook corresponding to 'Going with the Flow: An Introduction to Normalizing Flows'☆25Updated 3 years ago
- Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network☆64Updated 3 years ago
- ☆42Updated 4 years ago
- Code for the Thermodynamic Variational Objective☆26Updated 2 years ago
- Pytorch implementation of Variational Dropout Sparsifies Deep Neural Networks☆84Updated 2 years ago
- A lightweight library for tensorflow 2.0☆66Updated 4 years ago
- Limitations of the Empirical Fisher Approximation☆45Updated 4 years ago
- Loss Patterns of Neural Networks☆82Updated 3 years ago
- ☆11Updated 4 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆77Updated 4 years ago
- ☆32Updated 6 years ago
- Normalizing Flows in Jax☆105Updated 4 years ago
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆21Updated 4 years ago