yuqingd / ellmLinks
☆88Updated 2 years ago
Alternatives and similar repositories for ellm
Users that are interested in ellm are comparing it to the libraries listed below
Sorting:
- Official code repository for Prompt-DT.☆118Updated 3 years ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆50Updated last year
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆382Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Implementation of TWOSOME☆82Updated 10 months ago
- [AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"☆32Updated 9 months ago
- Overcooked human-AI experiment platform☆39Updated last year
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆26Updated 3 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆165Updated 2 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆132Updated 4 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆17Updated last year
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆144Updated 2 years ago
- off-policy RL on long sequences☆152Updated 3 months ago
- ☆42Updated 2 years ago
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆190Updated 11 months ago
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆32Updated last year
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆32Updated last year
- ☆63Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆80Updated last year
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆49Updated 8 months ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆18Updated last year
- ☆39Updated last year
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆130Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆93Updated last year
- Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)☆23Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Updated last month
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆129Updated 2 years ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆39Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆95Updated 2 years ago
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago