jxbz / fromage
🧀 Pytorch code for the Fromage optimiser.
☆124Updated 9 months ago
Alternatives and similar repositories for fromage:
Users that are interested in fromage are comparing it to the libraries listed below
- A library for evaluating representations.☆76Updated 3 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆180Updated 3 years ago
- Very deep VAEs in JAX/Flax☆46Updated 3 years ago
- 👩 Pytorch and Jax code for the Madam optimiser.☆51Updated 4 years ago
- ☆99Updated 3 years ago
- ☆77Updated 5 years ago
- ☆153Updated 4 years ago
- Codebase for Learning Invariances in Neural Networks☆95Updated 2 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆78Updated 4 years ago
- Experiment code for "Randomized Automatic Differentiation"☆67Updated 4 years ago
- DeepOBS: A Deep Learning Optimizer Benchmark Suite☆106Updated last year
- Memory efficient MAML using gradient checkpointing☆84Updated 5 years ago
- Code for: Implicit Competitive Regularization in GANs☆114Updated 3 years ago
- Official code for the Stochastic Polyak step-size optimizer☆139Updated 10 months ago
- Code for the Thermodynamic Variational Objective☆26Updated 2 years ago
- Pytorch implementation of Variational Dropout Sparsifies Deep Neural Networks☆83Updated 3 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆59Updated 5 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆145Updated last year
- Autoregressive Energy Machines☆77Updated 2 years ago
- This repository is no longer maintained. Check☆81Updated 5 years ago
- Python implementation of GLN in different frameworks☆98Updated 4 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆109Updated 2 years ago
- ☆67Updated last year
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- A Machine Learning workflow for Slurm.☆149Updated 4 years ago
- Loss Patterns of Neural Networks☆84Updated 3 years ago
- Code from the article: "The Role of Disentanglement in Generalisation" (ICLR, 2021).☆22Updated 2 years ago
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- Official PyTorch BIVA implementation (BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling)☆84Updated 2 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆209Updated last year