ReinholdM / Papers-of-Offline-RLView external linksLinks
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
☆18Apr 21, 2022Updated 3 years ago
Alternatives and similar repositories for Papers-of-Offline-RL
Users that are interested in Papers-of-Offline-RL are comparing it to the libraries listed below
Sorting:
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Oct 18, 2022Updated 3 years ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- Reinforcement learning - Batched Impala - PyTorch - Mario Kart☆13Jul 21, 2020Updated 5 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆18Oct 24, 2022Updated 3 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 3 years ago
- A peper list for machine learning models solving combinatorial problems, NP-hard problems and problems in graphs.☆14Aug 14, 2020Updated 5 years ago
- The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141☆56Dec 14, 2020Updated 5 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆23Feb 15, 2023Updated 3 years ago
- ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies☆18Jul 16, 2020Updated 5 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- It's the pytorch implementation of google research football.☆43Jun 14, 2019Updated 6 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆60Apr 29, 2024Updated last year
- Web application where humans can play Overcooked with AI agents.☆60Dec 6, 2022Updated 3 years ago
- [ICLR '21] Interpretable Neural Architecture Search using Bayesian Optimisation with Weisfiler-Lehman Kernel (NAS-BOWL)☆23Dec 27, 2021Updated 4 years ago
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆32Dec 5, 2024Updated last year
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆26Sep 25, 2018Updated 7 years ago
- In-depth code associated with my Medium blog post, "How to Load PyTorch Models 340 Times Faster with Ray"☆28Sep 2, 2022Updated 3 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- Process Simulations Meet AI. Supercharge Your Process Engineering. Generate Infinite Data, Train Advanced Models, and Revolutionise Indus…☆11Oct 8, 2024Updated last year
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- [S&P 2024] Replication Package for "Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets".☆32Dec 30, 2024Updated last year
- (AAAI'2019) The codes, models, logs, and data for an extended paper of the original paper "On Reinforcement Learning for Full-length Game…☆30Oct 5, 2022Updated 3 years ago
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆44Dec 17, 2021Updated 4 years ago
- [IROS2023]Learning to Solve Tasks with Exploring Prior Behaviours☆12Mar 3, 2024Updated last year
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆152Mar 19, 2021Updated 4 years ago
- PhD Thesis Template with Roboto Font and Color Sections☆11Jul 24, 2020Updated 5 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- Explains the conclusions of a logic program.☆10May 25, 2023Updated 2 years ago
- Deep Structure Learning for Fraud Detection (ICDM 2018)☆10Oct 2, 2020Updated 5 years ago
- Notes and samples for Python performance talk☆10Feb 17, 2022Updated 4 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆26Oct 16, 2025Updated 4 months ago
- ☆12Sep 24, 2018Updated 7 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago