tedmoskovitz/TOP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tedmoskovitz/TOP)

tedmoskovitz / TOP

Implementation of Tactical Optimistic and Pessimistic value estimation

☆25

Alternatives and similar repositories for TOP

Users that are interested in TOP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

philipjball / TD3_PyTorch
View on GitHub
♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation
☆10Jun 20, 2021Updated 5 years ago
tianjunz / NovelD
View on GitHub
☆40Nov 23, 2021Updated 4 years ago
yfletberliac / adversarially-guided-actor-critic
View on GitHub
AGAC: Adversarially Guided Actor-Critic
☆47Sep 16, 2021Updated 4 years ago
facebookresearch / e3b
View on GitHub
Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".
☆87Mar 22, 2024Updated 2 years ago
microsoft / oac-explore
View on GitHub
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆70Aug 11, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
facebookresearch / impact-driven-exploration
View on GitHub
impact-driven-exploration
☆136Oct 3, 2023Updated 2 years ago
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
sfujim / SR-DICE
View on GitHub
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆28Dec 7, 2021Updated 4 years ago
philipjball / ReadyPolicyOne
View on GitHub
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Jul 6, 2023Updated 3 years ago
holarissun / RewardShifting
View on GitHub
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆29Oct 29, 2023Updated 2 years ago
aronsar / hoad
View on GitHub
☆14Jun 17, 2022Updated 4 years ago
microsoft / lightATAC
View on GitHub
A lightweight reimplementation of Adversarially Trained Actor Critic
☆19Mar 19, 2026Updated 4 months ago
boschresearch / ube-mbrl
View on GitHub
Model-Based Uncertainty in Value Functions (AISTATS2023)
☆16Feb 28, 2023Updated 3 years ago
microsoft / ATAC
View on GitHub
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …
☆74Feb 2, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hmishfaq / LSAC
View on GitHub
The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025
☆22May 28, 2025Updated last year
snu-mllab / EMI
View on GitHub
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
tohsin / Safe-panda-gym
View on GitHub
OpenaAI Gym Franka Emika Panda robot environment based on PyBullet.
☆12Sep 8, 2023Updated 2 years ago
sparisi / cbet
View on GitHub
Change-Based Exploration Transfer
☆35Apr 24, 2022Updated 4 years ago
google-research / pisac
View on GitHub
Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)
☆45Jun 8, 2023Updated 3 years ago
oxwhirl / opiq
View on GitHub
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Aug 4, 2020Updated 5 years ago
xingchenwan / bgpbt
View on GitHub
[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)
☆31Sep 16, 2022Updated 3 years ago
JasonMa2016 / CODAC
View on GitHub
Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)
☆22Aug 1, 2021Updated 4 years ago
sahandrez / homomorphic_policy_gradient
View on GitHub
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆24Apr 8, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
samlobel / CFN
View on GitHub
Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023
☆25Dec 29, 2023Updated 2 years ago
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 5 years ago
LXXXXR / ICES
View on GitHub
[ICML' 24] The PyTorch implementation of our paper: "Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforc…
☆25May 29, 2024Updated 2 years ago
sfujim / LAP-PAL
View on GitHub
Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
☆41Dec 7, 2021Updated 4 years ago
Lifelong-ML / offline-compositional-rl-datasets
View on GitHub
☆21Mar 19, 2024Updated 2 years ago
tigerneil / reinforcementlearning.today
View on GitHub
Made for a reading group at the Center for Safe AGI.
☆12Feb 23, 2026Updated 4 months ago
montrealrobotics / iv_rl
View on GitHub
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Jul 18, 2025Updated last year
baturaysaglam / actor-prioritized-exp-replay
View on GitHub
Actor Prioritized Experience Replay
☆19Nov 20, 2023Updated 2 years ago
StanfordASL / BaRC
View on GitHub
Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…
☆12Jun 20, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RajGhugare19 / VE-principle-for-model-based-RL
View on GitHub
Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…
☆18Apr 13, 2021Updated 5 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
AlgTUDelft / WCSAC
View on GitHub
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"
☆67Aug 3, 2023Updated 2 years ago
KyriacosShiarli / taco
View on GitHub
☆25Jan 2, 2019Updated 7 years ago
IndustAI / risk-and-uncertainty
View on GitHub
Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"
☆11Oct 3, 2023Updated 2 years ago