yuqingd / ellm
☆67Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ellm
- Official code repository for Prompt-DT.☆98Updated 2 years ago
- Implementation of TWOSOME☆49Updated 6 months ago
- Overcooked human-AI experiment platform☆30Updated 11 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆151Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorch☆43Updated last year
- ☆22Updated 10 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆91Updated this week
- A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM☆63Updated 2 months ago
- [ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"☆129Updated 3 weeks ago
- ☆86Updated 2 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆54Updated 5 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆80Updated last year
- ☆12Updated 8 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆38Updated 3 weeks ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆173Updated 2 months ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆93Updated 2 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆38Updated 7 months ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆114Updated 3 years ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆90Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆19Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated 8 months ago
- Transformer-based World Models☆71Updated last year
- ☆106Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆220Updated 2 months ago
- ☆26Updated last year
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆29Updated last year
- ☆11Updated 8 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- ☆11Updated 10 months ago