apexrl / QSnakeGame
Snake game RL environment for Ubiquant competition 2022.
☆21Updated 2 years ago
Alternatives and similar repositories for QSnakeGame
Users that are interested in QSnakeGame are comparing it to the libraries listed below
Sorting:
- A python module designed for agile RL algorithm developing.☆26Updated 10 months ago
- RLA is a tool for managing your RL experiments automatically☆72Updated 2 years ago
- ☆11Updated last year
- ☆17Updated last year
- ☆29Updated 3 years ago
- Financial Big Data and Quantitative Analytics, Spring 2021.☆93Updated 3 years ago
- CS285课程笔记☆21Updated 5 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- ☆15Updated 4 years ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆51Updated 4 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 3 years ago
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Updated 2 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- ☆12Updated 2 years ago
- Re-implementations of SOTA RL algorithms.☆132Updated last year
- Code for Transfering Hierarchical Structure with Dual Meta Imitation Learning.☆15Updated 3 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆19Updated 3 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Updated 2 years ago
- A beamer template for LAMDA lab at NJU☆14Updated 4 years ago
- Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>☆14Updated 2 years ago
- A simple framework for distributed reinforcement learning in PyTorch.☆16Updated 5 years ago
- ☆46Updated 2 years ago
- Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.☆25Updated 7 months ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆24Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- ☆42Updated 3 years ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆10Updated 2 years ago
- official implementation of ODICE☆18Updated last year
- ☆124Updated 3 years ago