Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
☆18Apr 21, 2022Updated 3 years ago
Alternatives and similar repositories for Papers-of-Offline-RL
Users that are interested in Papers-of-Offline-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆18Oct 24, 2022Updated 3 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Oct 18, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Reinforcement learning - Batched Impala - PyTorch - Mario Kart☆13Jul 21, 2020Updated 5 years ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆24Feb 15, 2023Updated 3 years ago
- The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141☆57Dec 14, 2020Updated 5 years ago
- Counterfactual explanations for Reinforcement Learning agents on Atari☆12Apr 3, 2023Updated 2 years ago
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆44Dec 17, 2021Updated 4 years ago
- PRML Page-by-page配套资料,对PRML全书及各章节的review☆17Apr 16, 2024Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆61Apr 29, 2024Updated last year
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction☆35Nov 3, 2023Updated 2 years ago
- ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies☆18Jul 16, 2020Updated 5 years ago
- Decision Transformer: A brand new Offline RL Pattern.☆37Jan 28, 2022Updated 4 years ago
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated last year
- ☆26Apr 24, 2020Updated 5 years ago
- ☆10Sep 9, 2022Updated 3 years ago
- Version 3.0.0 Pytorch implementations of DQN, DDQN, DDPG, SAC, Discrete SAC. With more features :)☆12Feb 16, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Invariant Causal Prediction for Block MDPs☆44Jun 11, 2020Updated 5 years ago
- Simulation of car parking in different parking lots using Unity ML-Agents☆12Dec 16, 2023Updated 2 years ago
- ☆12Feb 20, 2021Updated 5 years ago
- ☆17Sep 23, 2022Updated 3 years ago
- Setup for Octo and some experiments with the model☆12Apr 11, 2024Updated last year
- ☆15Jun 1, 2023Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Mar 6, 2025Updated last year
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 9 months ago
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆32Dec 5, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆14Jul 4, 2022Updated 3 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Jun 24, 2022Updated 3 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆152Mar 19, 2021Updated 5 years ago
- The open source of FeverBasketball environment for research purpose.☆11Mar 2, 2020Updated 6 years ago
- This repository accompanies the following paper: A Workflow for Offline Model-Free Robotic RL☆12Nov 5, 2021Updated 4 years ago