jys5609 / MC-LAVE-RL
ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"
☆32Updated last year
Alternatives and similar repositories for MC-LAVE-RL:
Users that are interested in MC-LAVE-RL are comparing it to the libraries listed below
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated 3 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆53Updated 6 months ago
- ☆26Updated last year
- ☆17Updated last month
- Implementation of Multi-Game Decision Transformers in PyTorch☆45Updated last year
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆42Updated 2 years ago
- Official code repository for Prompt-DT.☆102Updated 2 years ago
- ☆18Updated last year
- ☆29Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆157Updated last year
- Official PyTorch implementation of AlberDICE☆22Updated last year
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆41Updated 6 months ago
- ☆76Updated 6 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- Clean, extensible implementation of MACAW [ICML 2021]☆13Updated 3 years ago
- ☆11Updated 10 months ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆17Updated 2 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆44Updated 3 years ago
- ☆46Updated 2 years ago
- A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.☆35Updated 2 weeks ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆60Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆54Updated 10 months ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆12Updated last year
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆22Updated 5 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last month
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆31Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year