yuqingd / ellmLinks
☆88Updated 2 years ago
Alternatives and similar repositories for ellm
Users that are interested in ellm are comparing it to the libraries listed below
Sorting:
- Official code repository for Prompt-DT.☆119Updated 3 years ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆50Updated last year
- Implementation of TWOSOME☆82Updated 11 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆166Updated 2 years ago
- Overcooked human-AI experiment platform☆39Updated 2 years ago
- ☆42Updated 2 years ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆383Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Updated 2 months ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆132Updated 4 years ago
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆192Updated last year
- ☆14Updated 2 years ago
- [AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"☆32Updated 10 months ago
- ☆40Updated last year
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆143Updated 2 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆80Updated last year
- off-policy RL on long sequences☆155Updated 4 months ago
- ☆115Updated 2 years ago
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆28Updated 4 months ago
- Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)☆23Updated last year
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆32Updated last year
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆25Updated 3 years ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆95Updated 2 years ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆18Updated last year
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆17Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆93Updated 2 years ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆135Updated last year
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆129Updated 2 years ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆51Updated 8 months ago
- Online Decision Transformer☆274Updated last year