machine-intelligence/rl-teacher-atari

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/machine-intelligence/rl-teacher-atari)

machine-intelligence / rl-teacher-atari

(This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficiently collecting human feedback.

☆29

Alternatives and similar repositories for rl-teacher-atari

Users that are interested in rl-teacher-atari are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mschweizer / Pref-RL
View on GitHub
Pref-RL provides ready-to-use PbRL agents that are easily extensible.
☆11Aug 31, 2022Updated 3 years ago
causalincentives / pycid
View on GitHub
Library for graphical models of decision making, based on pgmpy and networkx
☆113Sep 19, 2023Updated 2 years ago
lili-chen / SEER
View on GitHub
Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.
☆21Mar 5, 2021Updated 5 years ago
HumanCompatibleAI / rlsp
View on GitHub
Reward Learning by Simulating the Past
☆46May 9, 2019Updated 7 years ago
holken / polite
View on GitHub
code for polite
☆11Feb 28, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
JunhongXu / ppo-pytorch
View on GitHub
☆20Apr 10, 2018Updated 8 years ago
SpiNNakerManchester / SpiNNFrontEndCommon
View on GitHub
Common support code for user-facing front end systems.
☆12May 14, 2026Updated last week
ryan-p-randall / monthly-planning-files
View on GitHub
Text files to help plan & log whatever it is you do. Bullet journal + pomodoro technique + text editors + cloud syncing = progress.
☆16Aug 7, 2021Updated 4 years ago
tanmayshankar / AIR_papers
View on GitHub
Database of Artificial Intelligence and Robotics papers.
☆12Jul 11, 2016Updated 9 years ago
solislemuslab / tropical-stethoscope
View on GitHub
Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks (Sun et al, 2021)
☆13Oct 16, 2023Updated 2 years ago
ymetz / rlhfblender
View on GitHub
RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback
☆14Updated this week
RomeoV / ml4co-competition
View on GitHub
Winner of NeurIPS 2021 student leaderboard. Self-bootstrapping bayesian optimization for SCIP configuration using GNNs.
☆14Oct 28, 2022Updated 3 years ago
duchesneaumathieu / pyperlin
View on GitHub
GPU accelerated Perlin Noise in python
☆11Oct 23, 2020Updated 5 years ago
jangirrishabh / HER-learn-InverseKinematics
View on GitHub
Learning Inverse Kinematics of a Barret WAM Robotic arm in Gazebo simulation
☆11Jun 7, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
EleutherAGI / summarisation
View on GitHub
The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…
☆12Jul 14, 2021Updated 4 years ago
savya08 / REN
View on GitHub
Region Encoder Network
☆21Oct 2, 2025Updated 7 months ago
rll-research / rune
View on GitHub
Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
☆15May 26, 2022Updated 3 years ago
HumanCompatibleAI / seals
View on GitHub
Benchmark environments for reward modelling and imitation learning algorithms.
☆46Sep 19, 2023Updated 2 years ago
EvgenyKashin / stylegan2
View on GitHub
StyleGAN2 - Official TensorFlow Implementation with practical improvements
☆11Apr 17, 2020Updated 6 years ago
mcd4874 / Recommendation_system_using_RL_RecSim
View on GitHub
Explore the potential of recommendation system using reinforcement learning
☆15Apr 23, 2020Updated 6 years ago
ZucksLiu / OCTCubeM
View on GitHub
OCTCube-M: A 3D multimodal optical coherence tomography foundation model for retinal and systemic diseases with cross-cohort and cross-de…
☆31May 12, 2026Updated last week
chhwang / cmcl
View on GitHub
This code is for the paper "Confident Multiple Choice Learning".
☆17Aug 4, 2018Updated 7 years ago
jhejna / few-shot-preference-rl
View on GitHub
☆37Apr 27, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
txzhao / rl-zoo
View on GitHub
PyTorch implementation of various reinforcement learning algorithms
☆18Feb 22, 2018Updated 8 years ago
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated last year
BINDS-LAB-UMASS / snn-minibatch
View on GitHub
Repository for code, data, and other artifacts for "Minibatch Processing in Spiking Neural Networks"
☆14Nov 5, 2019Updated 6 years ago
yixiao1 / Action-Based-Representation-Learning
View on GitHub
☆14Nov 23, 2022Updated 3 years ago
JeremyAlain / imitation_learning_from_language_feedback
View on GitHub
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Mar 30, 2023Updated 3 years ago
geyang / e-maml
View on GitHub
E-MAML, and RL-MAML baseline implemented in Tensorflow v1
☆17Dec 7, 2019Updated 6 years ago
younggyoseo / trajectory_mcl
View on GitHub
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
☆39Oct 27, 2020Updated 5 years ago
bmazoure / ppo_jax
View on GitHub
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆60Aug 4, 2022Updated 3 years ago
srohit0 / food_mnist
View on GitHub
This dataset has 10 food categories, with 5,000 images. For each class, 125 manually reviewed test images are provided as well as 375 tra…
☆11Jun 22, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
nikaashpuri / sarfa-saliency
View on GitHub
☆36Jan 20, 2023Updated 3 years ago
Francesco-Sovrano / Generic-Hierarchical-Deep-Reinforcement-Learning-for-Sentiment-Analysis
View on GitHub
A3C and generic hierarchical RL for sentiment analysis tasks
☆15Dec 1, 2019Updated 6 years ago
victordibia / cocoafrica
View on GitHub
A Curation Tool and Dataset of Common Objects in the Context of Africa
☆18May 1, 2023Updated 3 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
mohmdelsayed / upgd
View on GitHub
a continual learning optimizer mitigating catastrophic forgetting and loss of plasticity
☆26Oct 14, 2024Updated last year
tldr-group / HR-Dv2
View on GitHub
Extracting high-resolution DINOv2 features for data-scarce applications like materials segmentation.
☆29Nov 10, 2025Updated 6 months ago
duskvirkus / colab-notebooks
View on GitHub
A collection of my machine learning notebooks to run on google colab. Mostly ml art.
☆20Jun 10, 2022Updated 3 years ago