justinjfu/diagnosing_qlearning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/justinjfu/diagnosing_qlearning)

justinjfu / diagnosing_qlearning

Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.

☆17

Alternatives and similar repositories for diagnosing_qlearning

Users that are interested in diagnosing_qlearning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

seungyulhan / disc
View on GitHub
☆10Aug 17, 2022Updated 3 years ago
yilundu / task_agnostic_dynamics_prior
View on GitHub
Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning
☆12Jun 13, 2019Updated 7 years ago
AnujMahajanOxf / VIREL
View on GitHub
Code for VIREL: A Variational Inference Framework for Reinforcement Learning
☆14Dec 1, 2019Updated 6 years ago
rail-berkeley / design-baselines
View on GitHub
☆24Feb 16, 2022Updated 4 years ago
avisingh599 / cog
View on GitHub
[CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning
☆35Oct 28, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FirefoxMetzger / scikit-bot
View on GitHub
Robotics in Python
☆13Feb 21, 2023Updated 3 years ago
rasoolfa / P3O
View on GitHub
P3O paper code
☆30Aug 7, 2019Updated 6 years ago
google-research / deep_ope
View on GitHub
☆88Jul 30, 2024Updated 2 years ago
anair13 / rlkit
View on GitHub
Collection of reinforcement learning algorithms
☆16Oct 6, 2021Updated 4 years ago
orybkin / lexa-benchmark
View on GitHub
☆42May 11, 2022Updated 4 years ago
young-geng / SimpleSAC
View on GitHub
A simple and easy to use implementation of the soft actor-critic algorithm.
☆15Sep 2, 2022Updated 3 years ago
Asap7772 / PTR
View on GitHub
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…
☆32Oct 26, 2022Updated 3 years ago
prasoongoyal / PixL2R
View on GitHub
☆17Dec 21, 2020Updated 5 years ago
denkiwakame / arxiv2scrap
View on GitHub
easy-to-use arXiv clipper for scrapbox
☆16Jan 12, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rail-berkeley / design-bench
View on GitHub
☆53Feb 16, 2022Updated 4 years ago
CausalML / DoubleReinforcementLearningMDP
View on GitHub
☆14May 15, 2025Updated last year
uncharted-technologies / robust-domain-randomization
View on GitHub
Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"
☆12Nov 22, 2022Updated 3 years ago
frt03 / inference-based-rl
View on GitHub
Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)
☆20Oct 25, 2021Updated 4 years ago
srsohn / msgi
View on GitHub
ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies
☆18Jul 16, 2020Updated 6 years ago
dgleaso / Stock-Binary-Classification-LSTM
View on GitHub
Uses an LSTM to predict the next days stock movement based on sequence of previous days
☆13Mar 9, 2021Updated 5 years ago
jmribeiro / yaaf
View on GitHub
Yet Another Agents Framework - An RL research-oriented framework for agent prototyping and evaluation
☆18Oct 9, 2023Updated 2 years ago
XanderJC / scalable-birl
View on GitHub
Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.
☆47Mar 12, 2021Updated 5 years ago
AdityaMate / collapsing_bandits
View on GitHub
Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)
☆11Dec 3, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SudeepDasari / one_shot_transformers
View on GitHub
☆25Nov 10, 2020Updated 5 years ago
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
francidellungo / Minigrid_HCI-project
View on GitHub
Train agents on MiniGrid from human demonstrations using Inverse Reinforcement Learning
☆13Apr 15, 2020Updated 6 years ago
aviralkumar2907 / MMCE
View on GitHub
☆19Jun 5, 2018Updated 8 years ago
RomainLaroche / SPIBB
View on GitHub
Safe Policy Improvement with Baseline Bootstrapping
☆26May 5, 2020Updated 6 years ago
perrin-isir / yomix
View on GitHub
An interactive tool to explore low dimensional embeddings of omics data.
☆18Mar 27, 2026Updated 4 months ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
ozansener / RecipeWatch
View on GitHub
☆12Jan 12, 2016Updated 10 years ago
theogruner / rl_pro_telu
View on GitHub
☆23Jun 8, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alexlee-gk / slac
View on GitHub
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
☆154Oct 26, 2020Updated 5 years ago
microsoft / roman
View on GitHub
Python library for real-time control of a robotic manipulator
☆21Feb 7, 2023Updated 3 years ago
clvoloshin / COBS
View on GitHub
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Aug 9, 2022Updated 3 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
syuntoku14 / pytorch-rl-il
View on GitHub
A library for building reinforcement learning and imitation learning agents in Pytorch
☆61Jun 13, 2020Updated 6 years ago
paramrathour / Nonlinear-Dynamics
View on GitHub
Files related to my Summer of Science Report on Nonlinear Dynamics
☆12Oct 11, 2023Updated 2 years ago
johanobandoc / revisiting_rainbow
View on GitHub
Revisiting Rainbow
☆76Jun 9, 2021Updated 5 years ago