yanxue7 / RL-LLM-PriorLinks
☆24Updated 7 months ago
Alternatives and similar repositories for RL-LLM-Prior
Users that are interested in RL-LLM-Prior are comparing it to the libraries listed below
Sorting:
- Benchmarking RL generalization in an interpretable way.☆174Updated last month
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆80Updated last year
- SocialJax: sequential social dilemma environments☆60Updated last month
- Object Centric Atari games☆96Updated last month
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆130Updated 6 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆189Updated 3 weeks ago
- ☆248Updated last year
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆38Updated 2 years ago
- A tool for aggregating and plotting MARL experiment data.☆80Updated 11 months ago
- Partially Observable Process Gym☆211Updated 7 months ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆34Updated last year
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆48Updated 9 months ago
- This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆43Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆159Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆231Updated last month
- Datasets with baselines for Offline MARL.☆196Updated 2 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆30Updated 4 months ago
- ☆42Updated 2 years ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆217Updated last month
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆341Updated last year
- Synthetic Experience Replay☆107Updated last year
- Representation Learning for RL☆129Updated 2 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆50Updated last year
- ☆306Updated 3 years ago
- Clean single-file implementation of offline RL algorithms in JAX☆165Updated last month
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆93Updated 2 years ago
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆38Updated 2 years ago
- This is a repository for Hidden-utility Self-Play.☆26Updated 2 years ago