karush17 / Deep-Eligibility-Traces
Implementation of Eligibility Traces with Neural Networks in PyTorch and Tensorflow 2.0
☆25Updated 3 years ago
Alternatives and similar repositories for Deep-Eligibility-Traces:
Users that are interested in Deep-Eligibility-Traces are comparing it to the libraries listed below
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Updated last year
- A tool for aggregating and plotting MARL experiment data.☆76Updated 2 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆101Updated 2 years ago
- Gridworld domains in the gym interface☆27Updated 6 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆141Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆30Updated 4 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆96Updated 5 months ago
- The Starcraft Multi-Agent challenge lite☆42Updated 6 months ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Repo for the multi-agent PressurePlate environment☆16Updated 3 years ago
- A collection of RL algorithms written in JAX.☆96Updated 2 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆41Updated 8 months ago
- Prioritized Experience Replay implementation with proportional prioritization☆76Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆138Updated 11 months ago
- clear single-file JAX implementations of common RL algorithms☆17Updated 3 years ago
- Simple gym environments for safety in Reinforcement Learning Research☆15Updated 8 months ago
- Deep Learning Project☆21Updated 5 years ago
- ☆31Updated 5 years ago
- Gridworld for MARL experiments☆139Updated 4 years ago
- ☆53Updated last year
- Baselines for gymnax 🤖☆66Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆41Updated last year
- Simple maze environments using mujoco-py☆54Updated last year
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆17Updated last month
- ☆27Updated 3 weeks ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆60Updated 3 years ago
- ☆41Updated last year
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆56Updated 4 years ago