villinvic / GeorgesLinks

Generating Evolutionary Opponents as a Reinforcement Guided Exploration Solution

☆8

Alternatives and similar repositories for Georges

Users that are interested in Georges are comparing it to the libraries listed below

Sorting:

MouseHu / GEM
☆14Updated 3 years ago
LAMDA-RL / ACT
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆13Updated last year
YiqinYang / VEM
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆13Updated 3 years ago
ucla-rlcourse / competitive-rl
A set of competitive environments for Reinforcement Learning research.
☆29Updated 2 years ago
sjtu-marl / bd_rd_psro
Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
☆20Updated 3 years ago
rraileanu / idaac
☆53Updated last year
danielshin1 / oprl
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Updated 2 years ago
baitingzbt / PEDA
Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023
☆32Updated 6 months ago
jsikyoon / V-MPO_torch
V-MPO torch version with DMLab30 and GTrXL
☆13Updated 4 years ago
apexrl / CoDAIL
Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>
☆18Updated 4 years ago
LAMDA-RL / PRDC
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Updated 7 months ago
manantomar / Mirror-Descent-Policy-Optimization
Mirror Descent Policy Optimization
☆38Updated 4 years ago
keynans / HypeRL
Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)
☆24Updated 4 years ago
ryanxhr / DWBC
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
☆34Updated 2 years ago
wenzhe-li / FightLadder
Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"
☆30Updated 11 months ago
proceduralia / high_replay_ratio_continuous_control
Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"
☆25Updated 2 years ago
diversepsro / diverse_psro
☆18Updated 4 years ago
hiwonjoon / IROS2021_SORS
☆11Updated 3 years ago
ygjin11 / task-hypernet
The official implementation of the paper "Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork".
☆11Updated last year
nigelyaoj / Quality-Similar-Diversity
Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning
☆17Updated 2 years ago
RyanNavillus / reward-surfaces
☆17Updated last year
pcchenxi / LAPO-offlienRL
☆15Updated 2 years ago
guosyjlu / OEMA
Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.
☆14Updated last year
lafmdp / HIDIL
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Updated 3 years ago
Baichenjia / PBRL
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆27Updated 3 years ago
JBLanier / pipeline-psro
Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
☆51Updated 9 months ago
morning9393 / Optimal-Baseline-for-Multi-agent-Policy-Gradients
☆28Updated 3 years ago
jidiai / Competition_Football
☆12Updated 3 years ago
hari-sikchi / AWAC
Advantage weighted Actor Critic for Offline RL
☆50Updated 2 years ago
lamda-bbo / madac
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
☆25Updated 2 years ago