apexrl / QSnakeGameLinks
Snake game RL environment for Ubiquant competition 2022.
☆22Updated 3 years ago
Alternatives and similar repositories for QSnakeGame
Users that are interested in QSnakeGame are comparing it to the libraries listed below
Sorting:
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- A python module designed for agile RL algorithm developing.☆26Updated last year
- Re-implementations of SOTA RL algorithms.☆135Updated 2 years ago
- ☆12Updated last year
- ☆30Updated 3 years ago
- A beamer template for LAMDA lab at NJU☆17Updated 5 years ago
- ☆315Updated 3 years ago
- Benchmarked implementations of Offline RL Algorithms.☆74Updated 7 months ago
- ☆90Updated 3 years ago
- ☆17Updated last year
- Paper Collection for Batch RL with brief introductions.☆85Updated 3 years ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Updated 3 years ago
- Paper Collection for Imitation Learning in RL.☆149Updated 3 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Updated 5 years ago
- CS285课程笔记☆22Updated 5 years ago
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆43Updated 3 years ago
- A collection of research and survey papers of hierarchical reinforcement learning (HRL).☆51Updated 5 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 3 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆26Updated 2 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆174Updated 2 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆24Updated last year
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Updated 5 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- [NeurIPS 2023] Efficient Diffusion Policy☆110Updated last year
- Financial Big Data and Quantitative Analytics, Spring 2021.☆91Updated 4 years ago
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆19Updated 3 years ago
- ☆126Updated 4 years ago
- ☆287Updated 3 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆71Updated 2 years ago
- ☆43Updated 4 years ago