nnaisense / pgpelibView external linksLinks
A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer (https://arxiv.org/abs/2008.02387) from NNAISENSE.
☆73Dec 10, 2020Updated 5 years ago
Alternatives and similar repositories for pgpelib
Users that are interested in pgpelib are comparing it to the libraries listed below
Sorting:
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- ☆28Jan 11, 2021Updated 5 years ago
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Apr 27, 2020Updated 5 years ago
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆19Apr 3, 2018Updated 7 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆35Oct 22, 2020Updated 5 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- Library to compare and evaluate reward functions☆67Oct 23, 2023Updated 2 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Sep 19, 2023Updated 2 years ago
- Code for the paper "Learning Step-Size Adaptation in CMA-ES"☆12Mar 24, 2023Updated 2 years ago
- This repository contains the code of the paper Equivariant Q Learning in Spatial Action Spaces☆11Nov 4, 2021Updated 4 years ago
- ☆20Nov 19, 2025Updated 2 months ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brain☆11Jul 14, 2020Updated 5 years ago
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆78Dec 5, 2023Updated 2 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Jun 14, 2018Updated 7 years ago
- This repository contains data and analysis scripts to reproduce the figures as well as source code and simulation scripts to perform the …☆13Apr 13, 2021Updated 4 years ago
- SynPick dataset generator☆13Jul 8, 2021Updated 4 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- Short-time Fourier transform (STFT) for JAX☆15Dec 20, 2021Updated 4 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Jun 30, 2020Updated 5 years ago
- Evolution Strategy Library☆56Jun 11, 2020Updated 5 years ago
- Predicting personality from the resting state EEG☆15Dec 5, 2014Updated 11 years ago
- Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"☆15Dec 20, 2021Updated 4 years ago
- Röttger et al. (2024): "IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance"☆16Jul 31, 2025Updated 6 months ago
- This packages provides a simple python implementation of Invariant Causal Prediction (ICP)☆13Mar 22, 2024Updated last year
- Avenue is a simulator designed to test and prototype reinforcement learning algorithms. Avenue is a ServiceNow Research project that was …☆14Jul 15, 2022Updated 3 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 5 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Code for Canonical 3D Deformable Mapping (C3DM) paper☆19Sep 19, 2021Updated 4 years ago
- Benchmarking TD3 and DDPG on PyBullet☆55Jun 19, 2019Updated 6 years ago
- Python Implementation of Parameter-exploring Policy Gradients Evolution Strategy☆17Apr 2, 2020Updated 5 years ago
- a deep recurrent model for exchangeable data☆34Jul 1, 2020Updated 5 years ago
- Formulating Model-based RL Dynamics as a continuous rather then one step prediction☆36Aug 24, 2022Updated 3 years ago
- EDGE: Scalable and optimum mutual information estimator for high-dimensional applications including deep learning☆39May 27, 2022Updated 3 years ago
- Matplotlib Backend for Altair Visualization Library☆16Dec 2, 2020Updated 5 years ago