Implementation of the TAMER algorithm from "Interactively Shaping Agents via Human Reinforcement" (Knox, Stone - 2009)
☆21May 6, 2020Updated 5 years ago
Alternatives and similar repositories for TAMER
Users that are interested in TAMER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Oct 3, 2023Updated 2 years ago
- This repository contains the research project that enables the robot to automatically join a group based on the modeled personal, social …☆11Nov 4, 2018Updated 7 years ago
- This resource page includes the battery aging test simulator in Matlab-Simulink, auto simulator in Matlab, and the datasets for 1000+ bat…☆12Jan 26, 2023Updated 3 years ago
- A project that uses Reinforcement Learning (Q-Learning) to trade stock.☆10Apr 23, 2017Updated 8 years ago
- IRL implementation based on Norvig's AIMA code.☆14May 2, 2014Updated 11 years ago
- A* (A-Star) algorithm for finding the shortest path in a maze☆15Dec 8, 2020Updated 5 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆335Nov 29, 2021Updated 4 years ago
- ☆14Nov 4, 2019Updated 6 years ago
- Direct SDK bindings for Python☆12May 22, 2022Updated 3 years ago
- ☆11Apr 22, 2022Updated 3 years ago
- Source code for the IROS21 paper Efficient Task Planning for Mobile Manipulation: a Virtual Kinematic Chain Perspective☆11Aug 2, 2021Updated 4 years ago
- eSNN - Learning similarity measure from data☆12Nov 28, 2019Updated 6 years ago
- Distributed RL Algorithms for Dynamic Energy Pricing in Microgrids☆18Jun 7, 2021Updated 4 years ago
- Pluggin and utils for viewing voxelgrids in RViz☆11Jun 14, 2021Updated 4 years ago
- ☆10Aug 23, 2022Updated 3 years ago
- 2022DCIC-基于文本字符的交易验证码识别☆19Feb 16, 2022Updated 4 years ago
- reinforcement learning algorithm for multi-objective optimization problem☆18Dec 19, 2020Updated 5 years ago
- A functional Telnet server written in C#☆15Sep 19, 2020Updated 5 years ago
- MVP for full stack BCI, P300, SSVEP, and MI☆17Mar 11, 2026Updated last week
- Code accompanying the manuscript: Van Baar, J., Chang, L., & Sanfey, A.G. (2019). The computational and neural substrates of moral strate…☆26Sep 29, 2022Updated 3 years ago
- ☆37Apr 27, 2023Updated 2 years ago
- Experiments showing effects of parameters on Maximum Entropy Inverse Reinforcement Learning using grid world☆15Nov 26, 2016Updated 9 years ago
- Awesome Self-Supervised Vision Learning☆11Mar 27, 2024Updated last year
- MiniTouch is a ServiceNow Research project that was started at Element AI.☆14Jul 5, 2023Updated 2 years ago
- Bayesian Inverse Reinforcement Learning with simple environments☆19May 17, 2022Updated 3 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆17Jun 18, 2024Updated last year
- TFW: Annotated Thermal Faces in the Wild Dataset☆24Sep 10, 2022Updated 3 years ago
- Implementation of the Paper: ICG-Net: A unified approach for instance centric grasping☆17Aug 22, 2025Updated 7 months ago
- Delving into the Continuous Domain Adaptation (ACM MM22)☆12Jul 10, 2022Updated 3 years ago
- ☆31Jun 16, 2022Updated 3 years ago
- This repository not only contains experience about parameter finetune, but also other in-practice experience such as model ensemble (boos…☆16Oct 29, 2017Updated 8 years ago
- ☆11Apr 25, 2024Updated last year
- Python implementation of Self-Organizing Maps using PyTorch☆12Apr 20, 2018Updated 7 years ago
- This repository features game simulations as machine learning environments to experiment with deep learning approaches such as deep reinf…☆32Jun 25, 2018Updated 7 years ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Jul 20, 2024Updated last year
- 变分推断在教育测量模型中的应用(项目反应理论,认知诊断模型) variational inference for psychometrics model (item response theoy, cognitive diagnosis models)☆17Jul 6, 2023Updated 2 years ago
- Re-Implementation of Gaussian Process Latent Variable Model algorithm & performance assessment against Kernel-PCA☆14Oct 9, 2024Updated last year
- RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback☆13Mar 16, 2026Updated last week
- Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition☆14Dec 22, 2022Updated 3 years ago