Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
☆18Apr 21, 2022Updated 3 years ago
Alternatives and similar repositories for Papers-of-Offline-RL
Users that are interested in Papers-of-Offline-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆18Oct 24, 2022Updated 3 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Oct 18, 2022Updated 3 years ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆25Feb 15, 2023Updated 3 years ago
- The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141☆57Dec 14, 2020Updated 5 years ago
- Counterfactual explanations for Reinforcement Learning agents on Atari☆12Apr 3, 2023Updated 3 years ago
- [S&P 2024] Replication Package for "Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets".☆33Dec 30, 2024Updated last year
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆44Dec 17, 2021Updated 4 years ago
- PRML Page-by-page配套资料,对PRML全书及各章节的review☆17Apr 16, 2024Updated 2 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆61Apr 29, 2024Updated last year
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Mar 23, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- ☆10Sep 19, 2023Updated 2 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction☆35Nov 3, 2023Updated 2 years ago
- ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies☆18Jul 16, 2020Updated 5 years ago
- Decision Transformer: A brand new Offline RL Pattern.☆38Jan 28, 2022Updated 4 years ago
- Web application where humans can play Overcooked with AI agents.☆60Dec 6, 2022Updated 3 years ago
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆10Sep 9, 2022Updated 3 years ago
- Version 3.0.0 Pytorch implementations of DQN, DDQN, DDPG, SAC, Discrete SAC. With more features :)☆12Feb 16, 2023Updated 3 years ago
- Simulation of car parking in different parking lots using Unity ML-Agents☆12Dec 16, 2023Updated 2 years ago
- ☆17Sep 23, 2022Updated 3 years ago
- Setup for Octo and some experiments with the model☆12Apr 11, 2024Updated 2 years ago
- ☆15Jun 1, 2023Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Mar 6, 2025Updated last year
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 10 months ago
- ☆14Jul 4, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Jun 24, 2022Updated 3 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆152Mar 19, 2021Updated 5 years ago
- The open source of FeverBasketball environment for research purpose.☆11Mar 2, 2020Updated 6 years ago
- This repository accompanies the following paper: A Workflow for Offline Model-Free Robotic RL☆13Nov 5, 2021Updated 4 years ago