jhejna / few-shot-preference-rlLinks
☆35Updated 2 years ago
Alternatives and similar repositories for few-shot-preference-rl
Users that are interested in few-shot-preference-rl are comparing it to the libraries listed below
Sorting:
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆60Updated last year
- Official code repository for Prompt-DT.☆116Updated 3 years ago
- ☆51Updated 2 years ago
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆28Updated last year
- An unofficial implementation for online decision transformer☆40Updated 3 years ago
- ☆29Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆89Updated 10 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆74Updated last year
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆30Updated 2 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆116Updated 3 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆66Updated last year
- ☆89Updated 3 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆24Updated 4 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- My Body Is A Cage☆41Updated 4 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Updated 2 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆68Updated 4 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆42Updated 3 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- ☆37Updated last year
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆27Updated 3 years ago
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆36Updated last year
- ☆48Updated 2 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Updated last year
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Updated 4 years ago