Using Natural Language for Reward Shaping in Reinforcement Learning
☆24Dec 11, 2023Updated 2 years ago
Alternatives and similar repositories for rl-learn
Users that are interested in rl-learn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Free and Open Platform for AI-assisted Computing☆10May 19, 2019Updated 7 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 5 years ago
- ☆13Jul 3, 2023Updated 2 years ago
- ☆10Feb 28, 2019Updated 7 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 3 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- An implementation of the Conflict-Based Search, written in Python 3. This project, however, will support weighted edges and uncertainty r…☆11Jun 13, 2020Updated 6 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- [NeurIPS 2020 Spotlight Oral] "Training Stronger Baselines for Learning to Optimize", Tianlong Chen*, Weiyi Zhang*, Jingyang Zhou, Shiyu …☆29Dec 30, 2021Updated 4 years ago
- A group of utilities useful for members of UTCS.☆13Nov 19, 2016Updated 9 years ago
- Cornell Instruction Following Framework☆35Oct 11, 2021Updated 4 years ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆15Aug 17, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This repository contains the Julia code for the paper "Competitive Gradient Descent"☆25Dec 18, 2019Updated 6 years ago
- Code that translates grammar into PDDL, runs a planner to produce multiple plans, translates plans into trainable lale pipelines and trai…☆19Sep 17, 2025Updated 8 months ago
- ☆18Nov 16, 2020Updated 5 years ago
- Landmark-Based Approaches for Goal Recognition as Planning.☆14Oct 17, 2025Updated 7 months ago
- a wavelet-based multifractal image analysis tool implementing the WTMM (Wavelet Transform Modulus Maxima) method.☆11Feb 1, 2020Updated 6 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- RL agent using private and shared world models☆11Jun 12, 2023Updated 3 years ago
- CS234 Project, Winter 2019☆10Mar 20, 2019Updated 7 years ago
- A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.☆16Jun 1, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repository of our accepted NeurIPS-2022 paper "Towards Versatile Embodied Navigation"☆22Dec 8, 2022Updated 3 years ago
- Enhance robot task understanding ability through visual semantic graph☆10May 20, 2021Updated 5 years ago
- Repository with data and code for the paper "Enhancing Battery Storage Energy Arbitrage with Deep Reinforcement Learning and Time-Series …☆22Jul 13, 2025Updated 11 months ago
- ☆12Mar 24, 2021Updated 5 years ago
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆11Jan 3, 2023Updated 3 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- ☆10Dec 9, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"☆41Aug 27, 2018Updated 7 years ago
- Code and real data for "Counterfactual Temporal Point Processes", NeurIPS 2022☆16Sep 26, 2022Updated 3 years ago
- DIgital Twin of Energy Storage System☆15Mar 16, 2026Updated 2 months ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Apr 17, 2019Updated 7 years ago
- ☆17Dec 21, 2020Updated 5 years ago
- RL for Energy Management of Microgrids☆11Mar 28, 2020Updated 6 years ago
- We reproduced DeepMind's results and implement a meta-learning (MLSH) agent which can generalize across minigames.☆29Mar 30, 2021Updated 5 years ago