apexrl / QSnakeGameLinks
Snake game RL environment for Ubiquant competition 2022.
☆21Updated 3 years ago
Alternatives and similar repositories for QSnakeGame
Users that are interested in QSnakeGame are comparing it to the libraries listed below
Sorting:
- A python module designed for agile RL algorithm developing.☆26Updated 11 months ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- Financial Big Data and Quantitative Analytics, Spring 2021.☆93Updated 3 years ago
- ☆11Updated last year
- ☆17Updated last year
- ☆29Updated 3 years ago
- CS285课程笔记☆21Updated 5 years ago
- A beamer template for LAMDA lab at NJU☆14Updated 4 years ago
- ☆12Updated 3 years ago
- A pack of reinforcement learning algorithms.☆84Updated 3 years ago
- ☆15Updated 4 years ago
- Re-implementations of SOTA RL algorithms.☆133Updated last year
- SJTU thesis template version 2021, modified from the official version☆46Updated 3 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Updated last year
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 2 years ago
- ☆25Updated 2 years ago
- ☆13Updated 2 years ago
- GPU cluster kubernetes configurations and usages☆34Updated 3 years ago
- References for factor model☆35Updated 4 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆41Updated 3 years ago
- ☆32Updated 9 months ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 3 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 3 years ago
- Implementation of Deep Reinforcement Learning Benchmark Algorithms, including DQN, Double DQN, Dueling DQN, Reinforce, Actor-Critic, A2C,…☆17Updated 3 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Updated 3 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago