AndreasMadsen / stable-nalu
Code for Neural Arithmetic Units (ICLR) and Measuring Arithmetic Extrapolation Performance (SEDL|NeurIPS)
☆147Updated 3 years ago
Alternatives and similar repositories for stable-nalu:
Users that are interested in stable-nalu are comparing it to the libraries listed below
- learning to search in pytorch☆110Updated 5 years ago
- ☆103Updated 4 years ago
- ☆61Updated 2 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆189Updated 2 years ago
- Probabilistic classification in PyTorch/TensorFlow/scikit-learn with Fenchel-Young losses☆185Updated last year
- Pip-installable differentiable stacks in PyTorch!☆65Updated 4 years ago
- ☆153Updated 4 years ago
- Loss Patterns of Neural Networks☆84Updated 3 years ago
- ☆83Updated 4 years ago
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- Implementation of "Learning with Random Learning Rates" in PyTorch.☆102Updated 5 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆59Updated 5 years ago
- ☆64Updated 5 years ago
- Autoregressive Energy Machines☆77Updated 2 years ago
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆154Updated 6 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆124Updated 9 months ago
- Training Transformer-XL on 128 GPUs☆140Updated 4 years ago
- Original PyTorch implementation of the Leap meta-learner (https://arxiv.org/abs/1812.01054) along with code for running the Omniglot expe…☆148Updated 2 years ago
- ☆182Updated 9 months ago
- ICLR Reproducibility Challenge 2019☆219Updated 5 years ago
- ☆48Updated 5 years ago
- NYU PSYCH-GA 3405.001 / DS-GA 3001.014 : Advancing AI through cognitive science☆131Updated 6 years ago
- Configure Python functions explicitly and safely☆126Updated 5 months ago
- Graduate topics course on learning discrete latent structure.☆67Updated 6 years ago
- Equi-normalization of Neural Networks☆115Updated 5 years ago
- Code for: Implicit Competitive Regularization in GANs☆114Updated 3 years ago
- ☆219Updated 6 years ago
- Comparing Fixed and Adaptive Computation Time for Recurrent Neural Networks☆35Updated 7 years ago
- Code for the Eager Translation Model from the paper You May Not Need Attention☆294Updated 6 years ago
- This repository is no longer maintained. Check☆81Updated 5 years ago