A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"
☆153Dec 6, 2018Updated 7 years ago
Alternatives and similar repositories for symplectic-gradient-adjustment
Users that are interested in symplectic-gradient-adjustment are comparing it to the libraries listed below
Sorting:
- Code for "Efficient optimization of loops and limits with randomized telescoping sums"☆27May 13, 2019Updated 6 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- Prototypes of differentiable differential equation solvers in JAX.☆27Feb 3, 2020Updated 6 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Guided Evolutionary Strategies☆273Apr 6, 2023Updated 2 years ago
- Minimax Optimization, Stackelberg Games, Generative Adversarial Networks☆19Feb 14, 2020Updated 6 years ago
- ☆47Jun 19, 2018Updated 7 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- Limitations of the Empirical Fisher Approximation☆49Mar 3, 2025Updated last year
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆1,019Mar 13, 2019Updated 6 years ago
- A Framework for Equilibrium Learning in Sealed-Bid Auctions☆24Mar 17, 2023Updated 2 years ago
- Inference on non-linear dynamical systems written in JAX☆11Aug 20, 2020Updated 5 years ago
- Implementation of Spectral Inference Networks, ICLR 2019☆171May 23, 2019Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Aug 21, 2018Updated 7 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆153Jun 22, 2023Updated 2 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 9 years ago
- This is a joint implementation of AdaShift optimizer, LGANs, and MaxGP.☆14Oct 7, 2020Updated 5 years ago
- Unsupervised Data Generated for GeoQuery and SAIL Datasets☆46Nov 5, 2016Updated 9 years ago
- bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent☆1,529Apr 13, 2024Updated last year
- Code for "Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer"☆244Aug 6, 2018Updated 7 years ago
- Progressive matrices dataset, as described in: Measuring abstract reasoning in neural networks (Barrett*, Hill*, Santoro*, Morcos, Lillic…☆182Feb 1, 2019Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago
- Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"☆529Nov 22, 2018Updated 7 years ago
- C++ implementation of Proximal Policy Optimization☆89Sep 1, 2022Updated 3 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Jun 28, 2019Updated 6 years ago
- Code for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934☆113Mar 3, 2020Updated 6 years ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆15Jan 18, 2019Updated 7 years ago
- Reproduce ICLR2018 submission "Emergent Communication through Negotiation"☆17Apr 19, 2018Updated 7 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Apr 17, 2019Updated 6 years ago
- Code for "Inference Suboptimality in Variational Autoencoders"☆14Mar 17, 2020Updated 5 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Mar 18, 2019Updated 6 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆374Oct 15, 2021Updated 4 years ago
- TensorFlow Reinforcement Learning☆3,135Dec 8, 2022Updated 3 years ago
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆225Mar 29, 2017Updated 8 years ago
- A PyTorch implementation of Conditional PixelCNNs☆27Jan 24, 2018Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 7 years ago
- This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural P…☆1,015Jan 19, 2021Updated 5 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Jan 12, 2019Updated 7 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆133Jul 2, 2019Updated 6 years ago