Learning From Human Preferences - Tensorflow+Keras Implementation
☆18Aug 17, 2017Updated 8 years ago
Alternatives and similar repositories for LearningFromHumanPreferences
Users that are interested in LearningFromHumanPreferences are comparing it to the libraries listed below
Sorting:
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆333Nov 29, 2021Updated 4 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- ☆19Mar 28, 2019Updated 6 years ago
- Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback☆562Jan 24, 2023Updated 3 years ago
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 7 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- 「行動データの計算論モデリング」のサポートページです。☆11Mar 1, 2021Updated 5 years ago
- Tensorflow implementation of the paper "Fast Compressive Sensing Using Generative Model with Structed Latent Variables"☆10Apr 7, 2020Updated 5 years ago
- ROS package to allow starting and stopping of rosbag recordings via service calls.☆10Mar 5, 2018Updated 7 years ago
- A python implementation of the COACH algorithm for the Cartpole problem in OpenAI gym.☆11Mar 15, 2019Updated 6 years ago
- An implementation of BiternionNets for ROS, ready to run on a robot.☆14Apr 11, 2018Updated 7 years ago
- Collect orderbook data from crypto exchanges and publish as GRPC☆13Jun 19, 2022Updated 3 years ago
- Probabilistic Motion Primives library☆13Dec 14, 2022Updated 3 years ago
- Public accompanying repository for Universite de Montreal's IFT 6757: Autnonomous Vehicles, Fall 2019.☆11Jun 21, 2022Updated 3 years ago
- Page Clipper Bookmarklet☆21Nov 14, 2015Updated 10 years ago
- A Battery Intraday Trading Engine, based on dynamic programming approximations, written in C++, wrapped for Python☆35Feb 5, 2026Updated 3 weeks ago
- Tutorials for the Robotics MVA 2023 class☆11Aug 1, 2024Updated last year
- PyTorch code for DeepTime: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting☆11Jan 9, 2023Updated 3 years ago
- ROBEL: Robotics Benchmarks for Learning with low-cost robots (dev fork)☆12Jul 30, 2020Updated 5 years ago
- Prototyping mujoco simulation environments.☆11Feb 20, 2025Updated last year
- ☆11Oct 6, 2020Updated 5 years ago
- Library for creating curves. Forked from https://github.com/stonneau/spline☆13Feb 13, 2026Updated 2 weeks ago
- A trading system in python with GUI extension in PYQT. Proposed accepted API : many including those in README.☆11Jun 10, 2020Updated 5 years ago
- ☆12Nov 22, 2017Updated 8 years ago
- Push-to-See: Learning Non-Prehensile Manipulation to Enhance Instance Segmentation via Deep Q-Learning☆13Sep 2, 2022Updated 3 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- ☆10Aug 26, 2022Updated 3 years ago
- Use Gaussian processes to estimate CNN classification uncertainty☆12Mar 3, 2018Updated 8 years ago
- ☆12Mar 2, 2018Updated 8 years ago
- VR Joystick Teleoperation for Isaac Lab with Meta Quest☆19May 10, 2025Updated 9 months ago
- Code for "PUMA: Deep Metric Imitation Learning for Stable Motion Primitives"☆15Oct 1, 2024Updated last year
- Material from M1P1, formalised in Lean☆15Nov 2, 2019Updated 6 years ago
- ur5 robot with robotiq parallel grippers for testing parallel grasping algorithms☆11Apr 10, 2016Updated 9 years ago
- Skill-based Teleoperation☆43Dec 4, 2025Updated 2 months ago
- Building/Packaging SLAM Libraries with conda☆13Apr 12, 2018Updated 7 years ago
- Python wrapper for MuJoCo physics simulation.☆12Feb 14, 2019Updated 7 years ago
- A collection of openFrameworks standalone apps for sending and receiving OSC between apps☆12Apr 11, 2017Updated 8 years ago
- Notes for my Calculus courses in college, written in Jupyter Notebooks☆12Jul 31, 2016Updated 9 years ago
- OpenAI Gym Environment for ROS.☆13Nov 1, 2017Updated 8 years ago