ml-research / pau
Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network
☆63Updated 3 years ago
Alternatives and similar repositories for pau:
Users that are interested in pau are comparing it to the libraries listed below
- Reliable Uncertainty Estimates in Deep Neural Networks using Noise Contrastive Priors☆62Updated 4 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆56Updated 3 years ago
- ☆67Updated 2 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- Code for paper "Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow"☆19Updated 4 years ago
- Pytorch implementation of the Power Spherical distribution☆74Updated 5 months ago
- Code for the Thermodynamic Variational Objective☆26Updated 2 years ago
- repo for paper: Adaptive Checkpoint Adjoint (ACA) method for gradient estimation in neural ODE☆54Updated 3 years ago
- Exemplar VAE: Linking Generative Models, Nearest Neighbor Retrieval, and Data Augmentation☆68Updated 4 years ago
- ☆62Updated 11 months ago
- ☆35Updated 2 years ago
- Code from the article: "The Role of Disentanglement in Generalisation" (ICLR, 2021).☆22Updated 2 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆72Updated 5 months ago
- Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"☆79Updated 2 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- [IJCAI'19, NeurIPS'19] Anode: Unconditionally Accurate Memory-Efficient Gradients for Neural ODEs☆103Updated 4 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆39Updated 4 years ago
- ☆53Updated 5 months ago
- ☆31Updated 4 years ago
- Codebase for Learning Invariances in Neural Networks☆93Updated 2 years ago
- ☆26Updated 5 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆33Updated 4 years ago
- Code for "'Hey, that's not an ODE:' Faster ODE Adjoints via Seminorms" (ICML 2021)☆86Updated 2 years ago
- Very deep VAEs in JAX/Flax☆46Updated 3 years ago
- Limitations of the Empirical Fisher Approximation☆46Updated 4 years ago
- ☆32Updated 6 years ago
- Official PyTorch BIVA implementation (BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling)☆83Updated 2 years ago
- Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"☆33Updated 2 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆122Updated 6 months ago