Implementation of the TAMER algorithm from "Interactively Shaping Agents via Human Reinforcement" (Knox, Stone - 2009)
☆21May 6, 2020Updated 6 years ago
Alternatives and similar repositories for TAMER
Users that are interested in TAMER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Oct 3, 2023Updated 2 years ago
- MIDS Capstone Project, Duke University, 2021☆10Apr 28, 2021Updated 5 years ago
- IRL implementation based on Norvig's AIMA code.☆14May 2, 2014Updated 12 years ago
- PreferenceNet: Encoding Human Preferences in Auction Design With Deep Learning☆17Aug 10, 2021Updated 4 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆337Nov 29, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- This package provides a pythonic interface to the Evocortex libirimager direct binding.☆10Feb 16, 2023Updated 3 years ago
- Experiments from "The Generalization-Stability Tradeoff in Neural Network Pruning": https://arxiv.org/abs/1906.03728.☆14Oct 23, 2020Updated 5 years ago
- ☆14Nov 4, 2019Updated 6 years ago
- Distributed RL Algorithms for Dynamic Energy Pricing in Microgrids☆18Jun 7, 2021Updated 5 years ago
- Pluggin and utils for viewing voxelgrids in RViz☆11Jun 14, 2021Updated 4 years ago
- Independent Component Analysis (or other spacial recomposition) embedded in neural layers☆12Jan 7, 2018Updated 8 years ago
- ☆10Aug 23, 2022Updated 3 years ago
- 2022DCIC-基于文本字符的交易验证码识别☆19Feb 16, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- reinforcement learning algorithm for multi-objective optimization problem☆18Dec 19, 2020Updated 5 years ago
- A functional Telnet server written in C#☆15Sep 19, 2020Updated 5 years ago
- Code of the paper "Interactive Learning of Temporal Feature for Control", published in the IEEE Robotics & Automation Magazine.☆12Dec 27, 2022Updated 3 years ago
- Algorithms for Uni-Modal Inverse Reinforcement Learning☆22Sep 23, 2022Updated 3 years ago
- ☆37Apr 27, 2023Updated 3 years ago
- ☆22Dec 18, 2023Updated 2 years ago
- 刷题代码 整理总结☆21Sep 7, 2020Updated 5 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆18Jun 18, 2024Updated last year
- TFW: Annotated Thermal Faces in the Wild Dataset☆26Sep 10, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ROS drivers for Optris thermal imagers☆25Jul 29, 2020Updated 5 years ago
- ☆15Jun 6, 2023Updated 3 years ago
- Initial commit☆13Aug 14, 2023Updated 2 years ago
- Generic whole body control library with QP: inverse dynamics and kinematics☆20Jul 14, 2023Updated 2 years ago
- RefTeacher is a strong baseline method for Semi-Supervised Referring Expression Comprehension.☆14May 26, 2023Updated 3 years ago
- Neocortex Unity SDK for Smart NPCs and Virtual Assistants☆34Mar 29, 2026Updated 2 months ago
- Toy implementations of CNNs☆28Dec 24, 2020Updated 5 years ago
- This repository features game simulations as machine learning environments to experiment with deep learning approaches such as deep reinf…☆32Jun 25, 2018Updated 7 years ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆43Jul 20, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- GibsonSim2RealChallenge @ CVPR2020☆36May 26, 2020Updated 6 years ago
- Jupyter notebook containing a solution to Sutton and Barto's gridworld problem with both a random agent and a Q-learning agent.☆33Feb 23, 2018Updated 8 years ago
- RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback☆14May 19, 2026Updated 3 weeks ago
- CbirByVgg-基于VGG16的图像检索程序,可视化查询并显示,可扩展到自己的图像库。☆22Jan 9, 2022Updated 4 years ago
- Self-supervised learning for EEG☆28Sep 2, 2020Updated 5 years ago
- Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition☆14Dec 22, 2022Updated 3 years ago
- This project models cascading failures in power system. Then it uses reinforcement learning to identify critical failure paths to avoid t…☆26May 26, 2021Updated 5 years ago