apexrl / QSnakeGameLinks
Snake game RL environment for Ubiquant competition 2022.
☆22Updated 3 years ago
Alternatives and similar repositories for QSnakeGame
Users that are interested in QSnakeGame are comparing it to the libraries listed below
Sorting:
- RLA is a tool for managing your RL experiments automatically☆72Updated 3 years ago
- A python module designed for agile RL algorithm developing.☆26Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- ☆12Updated last year
- ☆30Updated 3 years ago
- ☆13Updated 2 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 6 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Updated 3 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆72Updated 3 years ago
- ☆73Updated 2 years ago
- ☆15Updated 5 years ago
- ☆44Updated 4 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆47Updated 2 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆123Updated 2 years ago
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆76Updated 11 months ago
- Re-implementations of SOTA RL algorithms.☆136Updated 2 years ago
- ☆25Updated 3 years ago
- A pack of reinforcement learning algorithms.☆84Updated 4 years ago
- ☆12Updated 3 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Updated 2 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Updated 6 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Updated 6 years ago
- A collection of research and survey papers of hierarchical reinforcement learning (HRL).☆54Updated 5 years ago
- ☆33Updated last year
- ☆31Updated 3 years ago