ericjang / maml-jax
Implementation of Model-Agnostic Meta-Learning (MAML) in Jax
☆189Updated 2 years ago
Alternatives and similar repositories for maml-jax:
Users that are interested in maml-jax are comparing it to the libraries listed below
- Velocity in deep-learning research☆277Updated 2 years ago
- Autoregressive Energy Machines☆77Updated 2 years ago
- Basic experiment framework for tensorflow.☆92Updated 3 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆375Updated 2 years ago
- Guided Evolutionary Strategies☆270Updated 2 years ago
- Probabilistic classification in PyTorch/TensorFlow/scikit-learn with Fenchel-Young losses☆185Updated last year
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆154Updated 6 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆59Updated 5 years ago
- ☆182Updated 9 months ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 5 years ago
- 2019 talk at GECCO☆68Updated 5 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆93Updated 4 years ago
- Implementation of "Learning with Random Learning Rates" in PyTorch.☆102Updated 5 years ago
- Normalizing Flows in Jax☆107Updated 4 years ago
- Advanced Differentiable Neural Computer (ADNC) with application to bAbI task and CNN RC task.☆94Updated 2 years ago
- ☆133Updated 7 years ago
- Full World Models Implementation in Chainer☆165Updated 6 years ago
- ☆43Updated 5 years ago
- Augmented environments with RL☆103Updated 6 years ago
- Implementation of Spectral Inference Networks, ICLR 2019☆171Updated 5 years ago
- Some hard problems for reinforcement learning.☆32Updated 6 years ago
- A reinforcement learning framework☆155Updated 6 years ago
- learning to search in pytorch☆110Updated 5 years ago
- The Differentiable Cross-Entropy Method☆126Updated 4 years ago
- A Python implementation of the gradient REBAR estimator.☆46Updated 6 years ago
- ☆115Updated last year
- Highly Modular and Scalable Reinforcement Learning☆114Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.