yanxue7 / RL-LLM-PriorLinks
☆23Updated 4 months ago
Alternatives and similar repositories for RL-LLM-Prior
Users that are interested in RL-LLM-Prior are comparing it to the libraries listed below
Sorting:
- SocialJax: sequential social dilemma environments☆49Updated last month
 - Object Centric Atari games☆93Updated this week
 - Partially Observable Process Gym☆203Updated 4 months ago
 - A tool for aggregating and plotting MARL experiment data.☆79Updated 9 months ago
 - ☆240Updated 11 months ago
 - Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆76Updated last year
 - Benchmarking RL generalization in an interpretable way.☆166Updated 2 weeks ago
 - MR.Q is a general-purpose model-free reinforcement learning algorithm.☆117Updated 4 months ago
 - Unified Implementations of Offline Reinforcement Learning Algorithms☆115Updated 3 weeks ago
 - Fast and flexible multi-agent gridworld reinforcement learning environments.☆44Updated 7 months ago
 - Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆50Updated last year
 - Datasets with baselines for Offline MARL.☆182Updated this week
 - [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆39Updated 4 months ago
 - Simple single-file baselines for Q-Learning in pure-GPU setting☆188Updated 7 months ago
 - Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆337Updated last year
 - Author's PyTorch implementation of TD7 for online and offline RL☆151Updated 2 years ago
 - Gridworld domains in the gym interface☆29Updated last year
 - Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
 - This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆43Updated 2 years ago
 - Challenging Memory-based Deep Reinforcement Learning Agents☆104Updated last year
 - Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆135Updated last year
 - Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆34Updated last year
 - ☆109Updated 8 months ago
 - [NeurIPS 2023] Implementation of Elastic Decision Transformer☆35Updated 2 years ago
 - Representation Learning for RL☆127Updated 2 years ago
 - Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication☆228Updated last week
 - ☆42Updated 2 years ago
 - LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆37Updated 2 years ago
 - Evaluating long-term memory of reinforcement learning algorithms☆150Updated 2 years ago
 - Deep reinforcement learning without experience replay, target networks, or batch updates.☆270Updated 7 months ago