Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆54May 12, 2025Updated 9 months ago
Alternatives and similar repositories for PPO-RND
Users that are interested in PPO-RND are comparing it to the libraries listed below
Sorting:
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆54Nov 10, 2025Updated 3 months ago
- Random Network Distillation pytorch☆260Mar 4, 2019Updated 6 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.☆10Nov 30, 2021Updated 4 years ago
- A2C is a special case of PPO!☆22May 20, 2022Updated 3 years ago
- Visualisation of MCTS in Unity with C# for different games, being created for my third year university project at the University of York☆15Jun 12, 2018Updated 7 years ago
- A Simple Game Using Unity ML-Agents☆10Nov 20, 2020Updated 5 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆160Apr 28, 2024Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Electroplating simulation environment☆20Sep 26, 2024Updated last year
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 5 years ago
- Neural Network Genetic Algorithm library used for deep learning problems☆18Jun 2, 2021Updated 4 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆58Jan 22, 2021Updated 5 years ago
- A collection of RL algorithms written in JAX.☆105Jul 5, 2022Updated 3 years ago
- Docker-based, gym-like torcs environment with vision.☆20Apr 18, 2022Updated 3 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 3 years ago
- Code for the paper "Batch size invariance for policy optimization"☆56Apr 2, 2023Updated 2 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- Monte Carlo tree search (MCTS) on traveling salesman problem (TSP)☆22Apr 27, 2019Updated 6 years ago
- clear single-file JAX implementations of common RL algorithms☆16Sep 5, 2021Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- Collection of resources on plasticity loss in deep reinforcement learning☆23Nov 12, 2024Updated last year
- CartPole-v0 via PPO with GAE, PyTorch☆21Feb 10, 2019Updated 7 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 2 years ago
- RL Implementation☆19May 10, 2022Updated 3 years ago
- RL-Toolkit: A Research Framework for Robotics☆21Jan 22, 2026Updated last month
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆205Jun 18, 2024Updated last year
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆240Nov 23, 2025Updated 3 months ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- JAX implementations of various deep reinforcement learning algorithms.☆26Feb 2, 2025Updated last year
- Code for the paper "Exploration by Random Network Distillation"☆930Oct 1, 2020Updated 5 years ago
- Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Ch…☆51Nov 21, 2025Updated 3 months ago
- Random Network Distillation(RND) algo in Pytorch☆51Feb 26, 2019Updated 7 years ago
- ☆22May 14, 2021Updated 4 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 5 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆88Nov 22, 2017Updated 8 years ago