dmksjfl/MCQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dmksjfl/MCQ)

dmksjfl / MCQ

Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)

☆64

Alternatives and similar repositories for MCQ

Users that are interested in MCQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
snu-mllab / EDAC
View on GitHub
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
☆80Aug 14, 2022Updated 3 years ago
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
secury / optidice
View on GitHub
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
☆16Aug 3, 2023Updated 2 years ago
YangRui2015 / RORL
View on GitHub
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
☆24Feb 15, 2023Updated 3 years ago
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 3 years ago
ryanxhr / DWBC
View on GitHub
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
☆37Jan 5, 2023Updated 3 years ago
ltlhuuu / A2PR
View on GitHub
[ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…
☆34May 31, 2024Updated 2 years ago
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
PKU-RL / PTGM
View on GitHub
[ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning
☆30Mar 1, 2024Updated 2 years ago
Facebear-ljx / PROTO
View on GitHub
☆17May 25, 2023Updated 3 years ago
dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
sail-sg / OPER
View on GitHub
code for the paper Offline Prioritized Experience Replay
☆12Jun 13, 2023Updated 3 years ago
tung-nd / cwbc
View on GitHub
☆11Oct 3, 2022Updated 3 years ago
stanford-iris-lab / batch-exploration
View on GitHub
☆12Apr 25, 2022Updated 4 years ago
microsoft / lightATAC
View on GitHub
A lightweight reimplementation of Adversarially Trained Actor Critic
☆19Mar 19, 2026Updated 4 months ago
polixir / OfflineRL
View on GitHub
A collection of offline reinforcement learning algorithms.
☆211Nov 26, 2024Updated last year
davidbrandfonbrener / onestep-rl
View on GitHub
☆44Sep 19, 2021Updated 4 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
Div-Infinity / XQL
View on GitHub
Extreme Q-Learning: Max Entropy RL without Entropy
☆88Feb 14, 2023Updated 3 years ago
LanqingLi1993 / FOCAL-ICLR
View on GitHub
Code for FOCAL Paper Published at ICLR 2021
☆55Dec 4, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ReinholdM / Papers-of-Offline-RL
View on GitHub
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
☆19Apr 21, 2022Updated 4 years ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
ethanluoyc / optimal_transport_reward
View on GitHub
☆18Apr 11, 2024Updated 2 years ago
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
aviralkumar2907 / CQL
View on GitHub
Code for conservative Q-learning
☆486Dec 7, 2021Updated 4 years ago
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
Wenxuan-Zhou / PLAS
View on GitHub
Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]
☆54Oct 18, 2021Updated 4 years ago
Dragon-Zhuang / BPPO
View on GitHub
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆95Dec 13, 2023Updated 2 years ago
microsoft / ATAC
View on GitHub
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …
☆74Feb 2, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shlee94 / Off2OnRL
View on GitHub
☆61Feb 3, 2023Updated 3 years ago
thuml / SPOT
View on GitHub
Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239
☆22Jun 24, 2023Updated 3 years ago
Facebear-ljx / DOGE
View on GitHub
The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)
☆44Mar 6, 2023Updated 3 years ago
Farama-Foundation / D4RL
View on GitHub
A collection of reference environments for offline reinforcement learning
☆1,694Nov 18, 2024Updated last year
takuseno / d3rlpy-benchmarks
View on GitHub
Benchmark data for d3rlpy
☆22Nov 28, 2023Updated 2 years ago
philippe-eecs / IDQL
View on GitHub
Repo for Implicit Diffusion Q-Learning
☆126Dec 5, 2023Updated 2 years ago
denisyarats / exorl
View on GitHub
ExORL: Exploratory Data for Offline Reinforcement Learning
☆138Feb 8, 2022Updated 4 years ago