yuqingd / ellmLinks
☆78Updated last year
Alternatives and similar repositories for ellm
Users that are interested in ellm are comparing it to the libraries listed below
Sorting:
- Official code repository for Prompt-DT.☆111Updated 2 years ago
- Overcooked human-AI experiment platform☆37Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆165Updated last year
- ☆42Updated 7 months ago
- Implementation of TWOSOME☆73Updated 4 months ago
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆160Updated 5 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆58Updated 7 months ago
- ☆29Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆67Updated last year
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆131Updated 2 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆47Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆95Updated 10 months ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- ☆13Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆88Updated 2 years ago
- ☆89Updated 2 years ago
- ☆23Updated last year
- ☆61Updated 6 months ago
- CORRO code☆35Updated 2 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆122Updated 3 years ago
- ☆28Updated last year
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆35Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆124Updated last week
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆44Updated last year
- ☆102Updated 2 years ago
- [NeurIPS 2023] Efficient Diffusion Policy☆102Updated last year
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆39Updated 8 months ago
- This is a repository for Hidden-utility Self-Play.☆26Updated last year
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆95Updated 8 months ago