☆89Aug 21, 2023Updated 2 years ago
Alternatives and similar repositories for ellm
Users that are interested in ellm are comparing it to the libraries listed below
Sorting:
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆52Apr 19, 2024Updated last year
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Oct 27, 2025Updated 4 months ago
- ☆15Feb 25, 2026Updated last week
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆46Mar 13, 2025Updated 11 months ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Jan 27, 2026Updated last month
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"☆13Dec 4, 2023Updated 2 years ago
- Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)☆11Sep 16, 2025Updated 5 months ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆13May 28, 2025Updated 9 months ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- Benchmarking the Spectrum of Agent Capabilities☆522Jan 23, 2024Updated 2 years ago
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆25Feb 10, 2024Updated 2 years ago
- ☆14Jul 12, 2021Updated 4 years ago
- Implementations of Curious Replay for model-based adaptation.☆43Jul 5, 2023Updated 2 years ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆21Jul 14, 2024Updated last year
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆18Aug 9, 2024Updated last year
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆19Aug 20, 2023Updated 2 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆20Feb 9, 2024Updated 2 years ago
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆30Sep 20, 2025Updated 5 months ago
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆199Dec 17, 2024Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- ☆25Aug 19, 2024Updated last year
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Apr 2, 2024Updated last year
- This is a repository for Hidden-utility Self-Play.☆26Jul 27, 2023Updated 2 years ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆543Nov 17, 2025Updated 3 months ago
- ☆111Jul 2, 2024Updated last year
- Official Implementation of CL-ALFRED (ICLR'24)☆30Oct 24, 2024Updated last year
- ☆27Dec 14, 2023Updated 2 years ago
- Code for paper Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety.☆20May 22, 2022Updated 3 years ago
- ☆26Apr 26, 2024Updated last year
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆107Aug 11, 2024Updated last year
- PowerBiMIP is an open-source, efficient bilevel mixed-integer programming (BiMIP) solver, with a special focus on applications in power a…☆34Feb 26, 2026Updated last week
- PWM: Policy Learning with Large World Models☆64Aug 4, 2025Updated 7 months ago
- ☆19Sep 22, 2025Updated 5 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆244Dec 11, 2025Updated 2 months ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆119Jul 2, 2024Updated last year
- [ICLR 2024] PyTorch Code for Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks☆127Aug 21, 2024Updated last year