yanxue7 / RL-LLM-PriorLinks
☆23Updated 3 months ago
Alternatives and similar repositories for RL-LLM-Prior
Users that are interested in RL-LLM-Prior are comparing it to the libraries listed below
Sorting:
- SocialJax: sequential social dilemma environments☆45Updated 2 weeks ago
- Benchmarking RL generalization in an interpretable way.☆162Updated 3 months ago
- Object Centric Atari games☆89Updated 3 weeks ago
- ☆237Updated 10 months ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆71Updated last year
- A tool for aggregating and plotting MARL experiment data.☆77Updated 8 months ago
- Partially Observable Process Gym☆199Updated 3 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆111Updated 2 months ago
- Datasets with baselines for Offline MARL.☆178Updated 3 weeks ago
- This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆43Updated last year
- Unified Implementations of Offline Reinforcement Learning Algorithms☆95Updated 4 months ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆48Updated last year
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆44Updated 5 months ago
- off-policy RL on long sequences☆141Updated last month
- Simple single-file baselines for Q-Learning in pure-GPU setting☆182Updated 6 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆148Updated 2 years ago
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆125Updated 2 years ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆186Updated 4 months ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆330Updated last year
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆35Updated 2 years ago
- ☆282Updated 3 years ago
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆37Updated 3 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆108Updated last year
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆33Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆148Updated 2 years ago
- ☆83Updated 2 years ago
- ☆31Updated 2 years ago
- Official code repository for Prompt-DT.☆115Updated 3 years ago
- ☆43Updated 2 years ago
- ☆105Updated 6 months ago