Using Natural Language for Reward Shaping in Reinforcement Learning
☆24Dec 11, 2023Updated 2 years ago
Alternatives and similar repositories for rl-learn
Users that are interested in rl-learn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 5 years ago
- ☆13Jul 3, 2023Updated 2 years ago
- ☆10Apr 23, 2021Updated 5 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- An implementation of the Conflict-Based Search, written in Python 3. This project, however, will support weighted edges and uncertainty r…☆11Jun 13, 2020Updated 5 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- [NeurIPS 2020 Spotlight Oral] "Training Stronger Baselines for Learning to Optimize", Tianlong Chen*, Weiyi Zhang*, Jingyang Zhou, Shiyu …☆29Dec 30, 2021Updated 4 years ago
- Cornell Instruction Following Framework☆34Oct 11, 2021Updated 4 years ago
- This repository contains the Julia code for the paper "Competitive Gradient Descent"☆25Dec 18, 2019Updated 6 years ago
- Code that translates grammar into PDDL, runs a planner to produce multiple plans, translates plans into trainable lale pipelines and trai…☆18Sep 17, 2025Updated 8 months ago
- ☆18Nov 16, 2020Updated 5 years ago
- Landmark-Based Approaches for Goal Recognition as Planning.☆14Oct 17, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- a wavelet-based multifractal image analysis tool implementing the WTMM (Wavelet Transform Modulus Maxima) method.☆11Feb 1, 2020Updated 6 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- RL agent using private and shared world models☆11Jun 12, 2023Updated 2 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.☆16Jun 1, 2021Updated 4 years ago
- Repository of our accepted NeurIPS-2022 paper "Towards Versatile Embodied Navigation"☆22Dec 8, 2022Updated 3 years ago
- Enhance robot task understanding ability through visual semantic graph☆10May 20, 2021Updated 5 years ago
- Repository with data and code for the paper "Enhancing Battery Storage Energy Arbitrage with Deep Reinforcement Learning and Time-Series …☆22Jul 13, 2025Updated 10 months ago
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆11Jan 3, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"☆41Aug 27, 2018Updated 7 years ago
- Code and real data for "Counterfactual Temporal Point Processes", NeurIPS 2022☆16Sep 26, 2022Updated 3 years ago
- DIgital Twin of Energy Storage System☆15Mar 16, 2026Updated 2 months ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14May 1, 2018Updated 8 years ago
- RL for Energy Management of Microgrids☆11Mar 28, 2020Updated 6 years ago
- We reproduced DeepMind's results and implement a meta-learning (MLSH) agent which can generalize across minigames.☆29Mar 30, 2021Updated 5 years ago
- An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)☆37May 3, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- R-VQA: Visual Question Answering with Relation Facts☆19May 11, 2021Updated 5 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- ☆10Mar 5, 2024Updated 2 years ago
- Source code for the project Robust energy trading and scheduling for microgrids based on the Contract Collaboration Problem☆10Apr 29, 2022Updated 4 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Apr 22, 2024Updated 2 years ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆10Jan 12, 2021Updated 5 years ago
- ☆11Oct 30, 2017Updated 8 years ago