lych1233 / GAMMA-human-ai-collaborationLinks
☆11Updated 2 weeks ago
Alternatives and similar repositories for GAMMA-human-ai-collaboration
Users that are interested in GAMMA-human-ai-collaboration are comparing it to the libraries listed below
Sorting:
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Updated 4 years ago
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆21Updated 3 weeks ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 3 years ago
- solver for discrete Mixed Observable Markov Decision Processes☆11Updated 5 years ago
- Safe Reinforcement Learning with Natural Language Constraints☆15Updated 4 years ago
- ☆10Updated 3 years ago
- Inverse Constrained Reinforcement Learning (ICML 2021)☆25Updated 4 years ago
- ☆19Updated 2 years ago
- A set of algorithms and environments to train SafeRL agents, written in TensorFlow2 and OpenAI Gym.☆12Updated 3 years ago
- Robust Multi-Agent Reinforcement Learning with State Uncertainty☆13Updated 2 years ago
- ☆12Updated 3 years ago
- MBRL library in JAX☆12Updated 3 years ago
- ☆17Updated 3 years ago
- ☆16Updated 4 years ago
- ☆41Updated 2 years ago
- Representation Learning in RL☆13Updated 3 years ago
- ☆19Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Updated 2 years ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆53Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆61Updated last year
- Resilient Model-Based RL by Regularizing Posterior Predictability☆22Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Updated last year
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆94Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Updated 4 years ago
- ☆11Updated 4 years ago
- Benchmark data for d3rlpy☆21Updated 2 years ago
- ☆16Updated 3 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆67Updated 2 years ago