Danielhp95 / RegymLinks
☆12Updated 3 years ago
Alternatives and similar repositories for Regym
Users that are interested in Regym are comparing it to the libraries listed below
Sorting:
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Updated 6 years ago
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11Updated 5 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14Updated 2 years ago
- Reinforcement learning algorithm implementation☆10Updated 3 years ago
- Safe Reinforcement Learning with Natural Language Constraints☆15Updated 3 years ago
- Experiments for performing empirical game-theoretic analysis of networked system control for common-pool resource management using multi-…☆18Updated 4 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆16Updated 5 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆13Updated last year
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 3 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- ☆20Updated 4 years ago
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Updated last year
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 5 years ago
- ☆31Updated 6 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆22Updated 3 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Updated 4 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Updated 6 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 4 years ago
- ☆17Updated last year
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- ☆78Updated last year
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Updated 3 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆36Updated 5 years ago
- The official code base of Shared Experience Actor-Critic (NeurIPS2020)☆39Updated last year
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆25Updated 2 years ago
- Implementation of the Option-Critic Architecture☆40Updated 6 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Updated 6 years ago