A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
☆79Aug 22, 2024Updated last year
Alternatives and similar repositories for LLM4RL
Users that are interested in LLM4RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆54Apr 19, 2024Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆277Oct 27, 2025Updated 5 months ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- ☆23Jun 6, 2024Updated last year
- [IROS2023]Learning to Solve Tasks with Exploring Prior Behaviours☆12Mar 3, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- ☆47Jan 29, 2024Updated 2 years ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated 2 months ago
- ☆25Aug 21, 2024Updated last year
- ☆19Apr 2, 2024Updated 2 years ago
- Instruction Following Agents with Multimodal Transforemrs☆54Nov 3, 2022Updated 3 years ago
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆415Jan 7, 2026Updated 3 months ago
- Code for paper Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety.☆20May 22, 2022Updated 3 years ago
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆59Dec 3, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- code for polite☆11Feb 28, 2024Updated 2 years ago
- ☆16Feb 23, 2024Updated 2 years ago
- Code repo for MathAgent☆20Dec 15, 2023Updated 2 years ago
- [IJCAI 2021] Solving Continuous Control with Episodic Memory☆15Apr 10, 2022Updated 4 years ago
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆38Jun 14, 2024Updated last year
- [ICRA 2026] Official implementation of the paper "GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation"☆172Feb 27, 2026Updated last month
- A RL benchmark framework based on real world problem☆13Jun 28, 2023Updated 2 years ago
- ☆16Jul 1, 2021Updated 4 years ago
- ☆22Mar 16, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13May 21, 2024Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Oct 14, 2024Updated last year
- Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning☆12Dec 20, 2020Updated 5 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆61Oct 6, 2024Updated last year
- ☆12Mar 15, 2022Updated 4 years ago
- ViViDex implementation under the SAPIEN simulator, ICRA 2025☆19Apr 9, 2025Updated last year
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆28Apr 4, 2024Updated 2 years ago
- ☆11Jan 11, 2022Updated 4 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆33Apr 14, 2024Updated 2 years ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆40May 2, 2024Updated last year
- Highway-Env Agent using DQN☆19May 29, 2022Updated 3 years ago
- Berkeley Deep Drive-X (eXplanation) dataset☆131Jan 18, 2019Updated 7 years ago
- Chain-of-Thought Predictive Control☆56May 1, 2023Updated 2 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆57Dec 27, 2023Updated 2 years ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆30Oct 12, 2023Updated 2 years ago