Learning From Human Preferences - Tensorflow+Keras Implementation
☆18Aug 17, 2017Updated 8 years ago
Alternatives and similar repositories for LearningFromHumanPreferences
Users that are interested in LearningFromHumanPreferences are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆335Nov 29, 2021Updated 4 years ago
- Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback☆562Jan 24, 2023Updated 3 years ago
- A simple moving dot environment for OpenAI Gym to test reinforcement learning algorithms☆23Sep 1, 2022Updated 3 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- An optimized version of SeqGAN in pytorch☆12Apr 24, 2018Updated 7 years ago
- ☆19Mar 28, 2019Updated 6 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 7 years ago
- Caffe/Neon prototxt training file for our Neurocomputing2017 work: Fuzzy Quantitative Deep Compression Network☆12May 30, 2018Updated 7 years ago
- (Experimental) ROS packages for Blue + Gazebo☆15Aug 4, 2019Updated 6 years ago
- A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment☆38Jun 5, 2025Updated 9 months ago
- PowerShell によって Windows10 のキッティングに必要な全工程を自動的に完了。☆12Jun 10, 2025Updated 9 months ago
- VQ-TR repository☆12Apr 18, 2024Updated last year
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- Utility for adding archive.org links to markdown files in the format [...](original link) ([a](archive.org link))☆21Aug 6, 2025Updated 7 months ago
- ☆13Jan 9, 2018Updated 8 years ago
- Notes for my Calculus courses in college, written in Jupyter Notebooks☆12Jul 31, 2016Updated 9 years ago
- My Data Provider: A minimal multi-exchange data providing project to feed trading algorithms/bots. Built with Python and FastAPI.☆12May 30, 2024Updated last year
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- Open AI gym environment for the Baxter robot☆14Oct 6, 2016Updated 9 years ago
- Interactive, web-based visual math assistant☆12Mar 14, 2026Updated last week
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Handeye calibration for FR3 & Realsense with Ros2. Using Ros2 Humble, easy_handeye2, ros2_aruco.☆20Jun 4, 2025Updated 9 months ago
- A set of demos using a Pioneer robot and based on ViSP☆14Jun 4, 2019Updated 6 years ago
- Tutorials for the Robotics MVA 2023 class☆11Aug 1, 2024Updated last year
- ☆10Aug 26, 2022Updated 3 years ago
- Skill-based Teleoperation☆44Dec 4, 2025Updated 3 months ago
- A powerful text cleaner for Japanese web texts☆12Jan 20, 2024Updated 2 years ago
- Collection of scripts for visualizing high dimensional data with scikit-learn and bh_tsne☆34Aug 22, 2015Updated 10 years ago
- Probabilistic Motion Primives library☆13Dec 14, 2022Updated 3 years ago
- ☆13Mar 2, 2018Updated 8 years ago
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- Material from M1P1, formalised in Lean☆15Nov 2, 2019Updated 6 years ago
- https://mth229.github.io☆13Mar 5, 2026Updated 2 weeks ago
- Serverless Scraper for Cryptocurrency Order Book Data☆15Dec 8, 2022Updated 3 years ago
- Prototyping mujoco simulation environments.☆11Feb 20, 2025Updated last year
- ROBEL: Robotics Benchmarks for Learning with low-cost robots (dev fork)☆12Jul 30, 2020Updated 5 years ago
- ☆14Mar 6, 2018Updated 8 years ago
- Code for the Black-DROPS algorithm: "Black-Box Data-efficient Policy Search for Robotics", IROS 2017/ICRA 2018☆66Nov 17, 2021Updated 4 years ago