dunnolab / awesome-in-context-rl
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
☆127Updated this week
Alternatives and similar repositories for awesome-in-context-rl:
Users that are interested in awesome-in-context-rl are comparing it to the libraries listed below
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - —☆65Updated 2 weeks ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆71Updated last year
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆29Updated 5 months ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆39Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆54Updated last year
- Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"☆82Updated last year
- JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️☆264Updated 3 months ago
- ☆215Updated 3 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆93Updated 4 months ago
- ☆79Updated 8 months ago
- Official implementation of the BRO algorithm☆38Updated last month
- Simple single-file baselines for Q-Learning in pure-GPU setting☆138Updated 3 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆111Updated last week
- Efficient baselines for autocurricula in JAX.☆181Updated 6 months ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆59Updated 9 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆159Updated last year
- ☆70Updated last year
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆126Updated 3 months ago
- Accelerated minigrid environments with JAX☆132Updated 7 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆41Updated 7 months ago
- Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (M…☆25Updated 3 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆54Updated 4 months ago
- Official code repository for Prompt-DT.☆106Updated 2 years ago
- A tool for aggregating and plotting MARL experiment data.☆71Updated last month
- Synthetic Experience Replay☆86Updated 9 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆63Updated 9 months ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆216Updated this week
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆48Updated last year