abhisheknaik96 / continuing-rl-exps
Code for running RL experiments on continuing (non-episodic) problems.
☆17Updated this week
Alternatives and similar repositories for continuing-rl-exps
Users that are interested in continuing-rl-exps are comparing it to the libraries listed below
Sorting:
- ☆42Updated 3 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆76Updated last month
- ☆103Updated 3 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆107Updated 3 years ago
- 动手学强化学习代码☆56Updated last year
- ☆96Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆130Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆155Updated last year
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆170Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 2 months ago
- Constrained Policy Optimization implementation on Safety Gym☆27Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆106Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆62Updated 11 months ago
- Transformer in RL for decision-making☆97Updated 2 years ago
- ☆203Updated last year
- implementation of MADDPG using PettingZoo and PyTorch☆140Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆77Updated 2 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 5 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆54Updated last year
- MATE: the Multi-Agent Tracking Environment.☆44Updated 2 years ago
- ☆16Updated 2 years ago
- ☆60Updated last week
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆53Updated 5 years ago
- ☆93Updated 4 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆56Updated 3 years ago