ReinholdM/Papers-of-Offline-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ReinholdM/Papers-of-Offline-RL)

ReinholdM / Papers-of-Offline-RL

Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)

☆19

Alternatives and similar repositories for Papers-of-Offline-RL

Users that are interested in Papers-of-Offline-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
YiqinYang / ICQ
View on GitHub
Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…
☆76Oct 18, 2022Updated 3 years ago
albertwilcox / mcac
View on GitHub
Author implementation of Monte Carlo Augmented Actor Critic in PyTorch
☆18Oct 24, 2022Updated 3 years ago
princeton-nlp / SRL-NLC
View on GitHub
Safe Reinforcement Learning with Natural Language Constraints
☆17Oct 24, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Sheepsody / Batched-Impala-PyTorch
View on GitHub
Reinforcement learning - Batched Impala - PyTorch - Mario Kart
☆13Jul 21, 2020Updated 6 years ago
microsoft / HuRL
View on GitHub
Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper
☆17Jan 3, 2022Updated 4 years ago
HxLyn3 / MPPVE
View on GitHub
☆10Sep 19, 2023Updated 2 years ago
hcmlab / GANterfactual-RL
View on GitHub
Counterfactual explanations for Reinforcement Learning agents on Atari
☆12Apr 3, 2023Updated 3 years ago
seungeunrho / football-paris
View on GitHub
The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141
☆57Dec 14, 2020Updated 5 years ago
dmksjfl / MCQ
View on GitHub
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆64Apr 29, 2024Updated 2 years ago
2019ChenGong / Offline_RL_Poisoner
View on GitHub
[S&P 2024] Replication Package for "Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets".
☆33Dec 30, 2024Updated last year
Bluedotdot2021 / PRML-book_review
View on GitHub
PRML Page-by-page配套资料，对PRML全书及各章节的review
☆17Apr 16, 2024Updated 2 years ago
atavakol / action-hypergraph-networks
View on GitHub
(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Jun 22, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Kaixhin / GUDRL
View on GitHub
Generalised UDRL
☆37May 12, 2022Updated 4 years ago
thu-ml / CEP-energy-guided-diffusion
View on GitHub
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction
☆35Nov 3, 2023Updated 2 years ago
feidieufo / homework
View on GitHub
Assignments for CS294-112.
☆30Sep 11, 2019Updated 6 years ago
HzcIrving / DecisionTransformer_StepbyStep
View on GitHub
Decision Transformer: A brand new Offline RL Pattern.
☆38Jan 28, 2022Updated 4 years ago
zhc134 / tlc-baselines
View on GitHub
☆27Apr 24, 2020Updated 6 years ago
HumanCompatibleAI / overcooked-demo
View on GitHub
Web application where humans can play Overcooked with AI agents.
☆60Dec 6, 2022Updated 3 years ago
beanie00 / Decision-ConvFormer
View on GitHub
[ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"
☆12Apr 22, 2024Updated 2 years ago
YangRui2015 / RORL
View on GitHub
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
☆24Feb 15, 2023Updated 3 years ago
BY571 / Implicit-Q-Learning
View on GitHub
PyTorch implementation of the implicit Q-learning algorithm (IQL)
☆44Dec 17, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
google-deepmind / constrained_optidice
View on GitHub
☆10Sep 9, 2022Updated 3 years ago
facebookresearch / icp-block-mdp
View on GitHub
Invariant Causal Prediction for Block MDPs
☆44Jun 11, 2020Updated 6 years ago
weiaiF / offlineRL-INTERACTION
View on GitHub
☆18Sep 23, 2022Updated 3 years ago
BinYang24 / Reinforcement-Learning-Pytorch
View on GitHub
☆12Feb 20, 2021Updated 5 years ago
erikbr01 / octo_experiments
View on GitHub
Setup for Octo and some experiments with the model
☆12Apr 11, 2024Updated 2 years ago
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 3 months ago
WeiXiongUST / Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning
View on GitHub
This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…
☆32Dec 5, 2024Updated last year
YuhangSong / Arena-Baselines
View on GitHub
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆103Mar 6, 2025Updated last year
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
deligentfool / SIDE
View on GitHub
Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"
☆11Jun 24, 2022Updated 4 years ago
GIS-PuppetMaster / Auto-STGCN
View on GitHub
source code of paper 'Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search Based on Reinforcement Learning and Exis…
☆11Jan 26, 2021Updated 5 years ago
DuaneNielsen / rnd
View on GitHub
Exploration by Random Network Distillation
☆15Dec 30, 2018Updated 7 years ago
dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
takuseno / d4rl-pybullet
View on GitHub
Datasets for data-driven deep reinforcement learning with PyBullet environments
☆152Mar 19, 2021Updated 5 years ago
NeteaseFuxiRL / FeverBasketball
View on GitHub
The open source of FeverBasketball environment for research purpose.
☆11Mar 2, 2020Updated 6 years ago
renweiya / RFQ-RFAC
View on GitHub
Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning
☆17Mar 11, 2020Updated 6 years ago