Using Natural Language for Reward Shaping in Reinforcement Learning
☆24Dec 11, 2023Updated 2 years ago
Alternatives and similar repositories for rl-learn
Users that are interested in rl-learn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 5 years ago
- ☆20May 1, 2024Updated 2 years ago
- ☆10Feb 28, 2019Updated 7 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- Source files for the 2020 ICAPS Online Summer School Lab on Plan Execution.☆11Oct 16, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 2 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- [NeurIPS 2020 Spotlight Oral] "Training Stronger Baselines for Learning to Optimize", Tianlong Chen*, Weiyi Zhang*, Jingyang Zhou, Shiyu …☆29Dec 30, 2021Updated 4 years ago
- Cornell Instruction Following Framework☆34Oct 11, 2021Updated 4 years ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆14Aug 17, 2023Updated 2 years ago
- This repository contains the Julia code for the paper "Competitive Gradient Descent"☆25Dec 18, 2019Updated 6 years ago
- Code that translates grammar into PDDL, runs a planner to produce multiple plans, translates plans into trainable lale pipelines and trai…☆18Sep 17, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Nov 16, 2020Updated 5 years ago
- Landmark-Based Approaches for Goal Recognition as Planning.☆14Oct 17, 2025Updated 6 months ago
- This is a program to solve NER with HMM. The principles and details can refer to my blog: https://blog.csdn.net/weixin_41679411/article/d…☆11Nov 20, 2018Updated 7 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- RL agent using private and shared world models☆11Jun 12, 2023Updated 2 years ago
- CS234 Project, Winter 2019☆10Mar 20, 2019Updated 7 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.☆16Jun 1, 2021Updated 4 years ago
- Repository of our accepted NeurIPS-2022 paper "Towards Versatile Embodied Navigation"☆22Dec 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Enhance robot task understanding ability through visual semantic graph☆10May 20, 2021Updated 4 years ago
- Repository with data and code for the paper "Enhancing Battery Storage Energy Arbitrage with Deep Reinforcement Learning and Time-Series …☆22Jul 13, 2025Updated 9 months ago
- ☆12Mar 24, 2021Updated 5 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆11Jan 3, 2023Updated 3 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"☆41Aug 27, 2018Updated 7 years ago
- Code and real data for "Counterfactual Temporal Point Processes", NeurIPS 2022☆16Sep 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DIgital Twin of Energy Storage System☆14Mar 16, 2026Updated last month
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14May 1, 2018Updated 8 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Apr 17, 2019Updated 7 years ago
- RL for Energy Management of Microgrids☆11Mar 28, 2020Updated 6 years ago
- We reproduced DeepMind's results and implement a meta-learning (MLSH) agent which can generalize across minigames.☆29Mar 30, 2021Updated 5 years ago
- An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)☆37May 3, 2020Updated 6 years ago
- R-VQA: Visual Question Answering with Relation Facts☆19May 11, 2021Updated 4 years ago