ml-research / pauLinks
Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network
☆63Updated 4 years ago
Alternatives and similar repositories for pau
Users that are interested in pau are comparing it to the libraries listed below
Sorting:
- ☆64Updated last year
- Codebase for Learning Invariances in Neural Networks☆96Updated 3 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆76Updated last year
- Monotone operator equilibrium networks☆53Updated 5 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆129Updated last year
- Code for "'Hey, that's not an ODE:' Faster ODE Adjoints via Seminorms" (ICML 2021)☆88Updated 2 years ago
- ☆54Updated last year
- [IJCAI'19, NeurIPS'19] Anode: Unconditionally Accurate Memory-Efficient Gradients for Neural ODEs☆106Updated 4 years ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 6 years ago
- Efficient Householder Transformation in PyTorch☆66Updated 4 years ago
- ☆68Updated 2 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆54Updated 6 years ago
- Implements stochastic line search☆118Updated 2 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆57Updated 4 years ago
- ☆47Updated 6 years ago
- 👩 Pytorch and Jax code for the Madam optimiser.☆52Updated 4 years ago
- Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"☆80Updated 3 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆208Updated last year
- Pytorch implementation of the Power Spherical distribution☆74Updated last year
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- Hypergradient descent☆149Updated last year
- Autoregressive Energy Machines☆78Updated 2 years ago
- Pytorch implementation of Variational Dropout Sparsifies Deep Neural Networks☆84Updated 3 years ago
- repo for paper: Adaptive Checkpoint Adjoint (ACA) method for gradient estimation in neural ODE☆56Updated 4 years ago
- ☆28Updated 3 years ago
- Optimization with orthogonal constraints and on general manifolds☆130Updated 5 years ago
- This repository is no longer maintained. Check☆81Updated 5 years ago
- Structured matrices for compressing neural networks☆67Updated 2 years ago
- Very deep VAEs in JAX/Flax☆46Updated 4 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆35Updated 5 years ago