google-deepmind / disco_rlLinks
Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication
☆228Updated last week
Alternatives and similar repositories for disco_rl
Users that are interested in disco_rl are comparing it to the libraries listed below
Sorting:
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆270Updated 7 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆117Updated 4 months ago
- ☆74Updated last year
- ☆87Updated 2 years ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆129Updated last year
- off-policy RL on long sequences☆146Updated 2 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆60Updated last year
- ☆110Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated 11 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆61Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Overcooked human-AI experiment platform☆39Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆68Updated last year
- Repo for Implicit Diffusion Q-Learning☆116Updated last year
- [NeurIPS 2024] Official Implementation of Meta-DT☆46Updated last year
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆95Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆110Updated last year
- Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.☆28Updated 4 months ago
- Online Decision Transformer☆272Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated 2 years ago
- On-Policy Policy Gradient Algorithms in JAX☆40Updated last year
- Official code repository for Prompt-DT.☆116Updated 3 years ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆51Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆76Updated last year
- Foundation Policies with Hilbert Representations (ICML 2024)☆98Updated last month
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆243Updated last month
- Unified Implementations of Offline Reinforcement Learning Algorithms☆115Updated 2 weeks ago
- Author's PyTorch implementation of TD7 for online and offline RL☆151Updated 2 years ago