(This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficiently collecting human feedback.
☆29Jan 22, 2019Updated 7 years ago
Alternatives and similar repositories for rl-teacher-atari
Users that are interested in rl-teacher-atari are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback☆563Jan 24, 2023Updated 3 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆336Nov 29, 2021Updated 4 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- Reward Learning by Simulating the Past☆46May 9, 2019Updated 7 years ago
- SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)☆16Sep 26, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Common support code for user-facing front end systems.☆12Jun 23, 2026Updated last week
- (Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning☆10Dec 8, 2022Updated 3 years ago
- Database of Artificial Intelligence and Robotics papers.☆12Jul 11, 2016Updated 9 years ago
- Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks (Sun et al, 2021)☆13Oct 16, 2023Updated 2 years ago
- ☆44Apr 5, 2023Updated 3 years ago
- RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback☆14May 19, 2026Updated last month
- ESP32 mqtt component☆11May 1, 2017Updated 9 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆18Jun 18, 2024Updated 2 years ago
- Learning Inverse Kinematics of a Barret WAM Robotic arm in Gazebo simulation☆11Jun 7, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Oct 25, 2020Updated 5 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Sep 19, 2023Updated 2 years ago
- ☆37Apr 27, 2023Updated 3 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- Repository for code, data, and other artifacts for "Minibatch Processing in Spiking Neural Networks"☆14Nov 5, 2019Updated 6 years ago
- StyleGAN2 - Official TensorFlow Implementation☆12Jul 15, 2020Updated 5 years ago
- Tools to use with Brian 2, in particular for visualization☆22May 19, 2026Updated last month
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆61Aug 4, 2022Updated 3 years ago
- This dataset has 10 food categories, with 5,000 images. For each class, 125 manually reviewed test images are provided as well as 375 tra…☆11Jun 22, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- msgpack-rpc + α for JavaScript language☆13Mar 8, 2022Updated 4 years ago
- ☆36Jan 20, 2023Updated 3 years ago
- A3C and generic hierarchical RL for sentiment analysis tasks☆15Dec 1, 2019Updated 6 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Maddpg_flight code☆10Jul 4, 2018Updated 7 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Universal 3D Sample Software☆14Sep 1, 2023Updated 2 years ago
- Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.☆33Jun 23, 2023Updated 3 years ago
- a continual learning optimizer mitigating catastrophic forgetting and loss of plasticity☆27Oct 14, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A simple example that shows how to write a X11 app on Linux in Swift using the new package manager☆13Oct 5, 2017Updated 8 years ago
- Zephyr项目官方文档翻译 - The Translation of Zephyr project documentation https://www.zephyrproject.org/doc/☆12Apr 15, 2016Updated 10 years ago
- This folder contains the simple implementation of probabilistic neural network in python.☆25Feb 23, 2019Updated 7 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆54Oct 18, 2021Updated 4 years ago
- ☆27Apr 22, 2024Updated 2 years ago
- A curated lists of self-taught materials including research blogs☆16Dec 12, 2016Updated 9 years ago
- Bash script to download Google maps satellite imagery for a given coordinate & zoom level to a JPG file☆19Apr 9, 2012Updated 14 years ago