☆90Aug 21, 2023Updated 2 years ago
Alternatives and similar repositories for ellm
Users that are interested in ellm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆53Apr 19, 2024Updated 2 years ago
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Oct 27, 2025Updated 8 months ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Jan 27, 2026Updated 5 months ago
- Templates and examples for ACL and EMNLP conference posters.☆14Oct 5, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Benchmarking the Spectrum of Agent Capabilities☆566Jan 23, 2024Updated 2 years ago
- ☆14Jul 12, 2021Updated 4 years ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆19Aug 9, 2024Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆50Mar 13, 2025Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- [KDD 2025] AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation☆34Nov 18, 2025Updated 7 months ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆19Aug 20, 2023Updated 2 years ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆556Nov 17, 2025Updated 7 months ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆22May 28, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"☆13Dec 4, 2023Updated 2 years ago
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆25Feb 10, 2024Updated 2 years ago
- An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.☆11Feb 6, 2023Updated 3 years ago
- [ICLR 2024] PyTorch Code for Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks☆129Aug 21, 2024Updated last year
- ☆119Apr 15, 2023Updated 3 years ago
- ☆18May 14, 2026Updated last month
- ☆22Oct 12, 2024Updated last year
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆45Oct 31, 2024Updated last year
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆208Dec 17, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)☆11Sep 16, 2025Updated 9 months ago
- Verlog: A Multi-turn RL framework for LLM agents☆73Apr 28, 2026Updated 2 months ago
- ☆54Feb 25, 2026Updated 4 months ago
- ☆28Apr 26, 2024Updated 2 years ago
- Deep Learning (FS 2020)☆17Oct 10, 2022Updated 3 years ago
- This is a repository for Hidden-utility Self-Play.☆27Jul 27, 2023Updated 2 years ago
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆42Sep 20, 2025Updated 9 months ago
- [ICLR 25 Spotlight] A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces☆14Mar 24, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Amazon Alexa Voice Controlled Drone☆10Jan 18, 2020Updated 6 years ago
- a benchmark to evaluate the situated inductive reasoning☆16Jan 7, 2025Updated last year
- A Probabilistic Load Forecasting Project. Mirror of https://git.rwth-aachen.de/acs/public/automation/plf/proloaf☆16Apr 2, 2026Updated 3 months ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆22Jul 14, 2024Updated last year
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆108Aug 11, 2024Updated last year
- decision-making processes of human drivers☆14Mar 28, 2024Updated 2 years ago
- Code for paper Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety.☆20May 22, 2022Updated 4 years ago