yanxue7 / RL-LLM-PriorLinks
☆23Updated 4 months ago
Alternatives and similar repositories for RL-LLM-Prior
Users that are interested in RL-LLM-Prior are comparing it to the libraries listed below
Sorting:
- SocialJax: sequential social dilemma environments☆47Updated 2 weeks ago
- Object Centric Atari games☆90Updated last month
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆112Updated 3 months ago
- ☆238Updated 10 months ago
- Benchmarking RL generalization in an interpretable way.☆165Updated 3 months ago
- Partially Observable Process Gym☆201Updated 4 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆112Updated 5 months ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆73Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆50Updated last year
- Datasets with baselines for Offline MARL.☆181Updated last month
- ☆43Updated 2 years ago
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆44Updated 6 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆103Updated 11 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆186Updated 6 months ago
- A tool for aggregating and plotting MARL experiment data.☆78Updated 8 months ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆266Updated 6 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆149Updated 2 years ago
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆37Updated 4 months ago
- A list of papers regarding generalization in (deep) reinforcement learning☆152Updated 2 years ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆33Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆147Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆190Updated last week
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆335Updated last year
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆37Updated 2 years ago
- off-policy RL on long sequences☆145Updated last month
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆21Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆28Updated 5 months ago
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆124Updated 2 years ago