XinJingHao / Actor-Sharer-Learner
Actor-Sharer-Learner training framework for off-policy DRL algorithms
☆19Updated 3 weeks ago
Alternatives and similar repositories for Actor-Sharer-Learner:
Users that are interested in Actor-Sharer-Learner are comparing it to the libraries listed below
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 2 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆36Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆44Updated 10 months ago
- ☆21Updated 8 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- ☆21Updated 9 months ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 2 years ago
- A modular implementation of PPO, and soon hopefully other algorithms.☆26Updated last year
- Scalable Opponent Shaping Experiments in JAX☆24Updated 9 months ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆32Updated 7 months ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)☆21Updated 2 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated last year
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆12Updated 8 months ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆20Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 9 months ago
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆14Updated last year
- DecentralizedLearning☆22Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- V-MPO torch version with DMLab30 and GTrXL☆12Updated 3 years ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆22Updated last year
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆12Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- ☆19Updated 7 months ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆13Updated 2 years ago