villinvic / Georges
Generating Evolutionary Opponents as a Reinforcement Guided Exploration Solution
☆8Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for Georges
- V-MPO torch version with DMLab30 and GTrXL☆12Updated 3 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆12Updated 2 years ago
- ☆13Updated 3 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆13Updated 9 months ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆14Updated 2 weeks ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆30Updated last year
- curriculum☆20Updated last year
- code for ROMANCE☆12Updated last month
- Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)☆29Updated 4 years ago
- ☆25Updated 2 years ago
- ☆28Updated last year
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Updated 2 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆20Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆25Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆23Updated last year
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆22Updated 3 weeks ago
- Scalable Multi-Agent Reinforcement Learning☆9Updated 2 years ago
- Deep Learning Project☆20Updated 4 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆19Updated last year
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13Updated 6 months ago
- A set of competitive environments for Reinforcement Learning research.☆28Updated last year
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆19Updated 2 years ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆13Updated last month
- PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020 spot…☆37Updated last year
- ☆32Updated last year
- ☆28Updated 3 years ago
- A variant of Varibad that is robust to difficult tasks☆9Updated last year
- ☆15Updated 3 months ago