A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
☆79Aug 22, 2024Updated last year
Alternatives and similar repositories for LLM4RL
Users that are interested in LLM4RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆53Apr 19, 2024Updated 2 years ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆278Oct 27, 2025Updated 6 months ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- ☆22Jun 6, 2024Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆248Dec 11, 2025Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [IROS2023]Learning to Solve Tasks with Exploring Prior Behaviours☆12Mar 3, 2024Updated 2 years ago
- [ICLR 25 Spotlight] A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- ☆47Jan 29, 2024Updated 2 years ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated 2 months ago
- ☆25Aug 21, 2024Updated last year
- ☆19Apr 2, 2024Updated 2 years ago
- A minimal home grid world environment to evaluate language understanding in interactive agents.☆24Sep 6, 2023Updated 2 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- Instruction Following Agents with Multimodal Transforemrs☆54Nov 3, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Dynamic Simulation Environments for Reinforcement Learning☆13Apr 17, 2021Updated 5 years ago
- Convert CVXPY expressions to PyTorch expressions☆18Jul 8, 2025Updated 9 months ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- ☆16Feb 23, 2024Updated 2 years ago
- Robust Multi-Agent Reinforcement Learning with State Uncertainty☆12May 30, 2023Updated 2 years ago
- ☆35Apr 22, 2020Updated 6 years ago
- Code accompanying our NeurIPS 2021 traffic4cast challenge☆27Sep 16, 2022Updated 3 years ago
- [IJCAI 2021] Solving Continuous Control with Episodic Memory☆15Apr 10, 2022Updated 4 years ago
- A program implemented by Pytorch for solving RCPSP and RCPSP with resource disruptions, based on graph neural network and reinforcement l…☆17Sep 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆38Jun 14, 2024Updated last year
- A RL benchmark framework based on real world problem☆13Jun 28, 2023Updated 2 years ago
- ☆16Jul 1, 2021Updated 4 years ago
- ☆14May 21, 2024Updated last year
- ☆24Oct 26, 2021Updated 4 years ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆44Dec 1, 2025Updated 5 months ago
- Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning☆12Dec 20, 2020Updated 5 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆61Oct 6, 2024Updated last year
- ☆12Mar 15, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Jan 11, 2022Updated 4 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 6 years ago
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆33Apr 14, 2024Updated 2 years ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆40May 2, 2024Updated 2 years ago
- ☆17Aug 6, 2021Updated 4 years ago
- Berkeley Deep Drive-X (eXplanation) dataset☆132Jan 18, 2019Updated 7 years ago
- Chain-of-Thought Predictive Control☆56May 1, 2023Updated 3 years ago