apexrl / QSnakeGameLinks
Snake game RL environment for Ubiquant competition 2022.
☆22Updated 3 years ago
Alternatives and similar repositories for QSnakeGame
Users that are interested in QSnakeGame are comparing it to the libraries listed below
Sorting:
- A python module designed for agile RL algorithm developing.☆26Updated last year
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- Re-implementations of SOTA RL algorithms.☆135Updated 2 years ago
- ☆12Updated last year
- ☆314Updated 3 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- ☆90Updated 3 years ago
- ☆30Updated 3 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆26Updated 2 months ago
- ☆73Updated last year
- A beamer template for LAMDA lab at NJU☆16Updated 4 years ago
- A collection of research and survey papers of hierarchical reinforcement learning (HRL).☆50Updated 5 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆175Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- CS285课程笔记☆22Updated 5 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆121Updated 2 years ago
- ☆12Updated 3 years ago
- ☆25Updated 3 years ago
- The code of paper Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic. Zhihai Wang, Jie Wang*, Qi Zhou, Bin…☆20Updated 3 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- ☆124Updated 4 years ago
- Official Pytorch Implementation of CMLO in the paper ”When to Update Your Model: Constrained Model-based Reinforcement Learning“☆10Updated last year
- Financial Big Data and Quantitative Analytics, Spring 2021.☆91Updated 4 years ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Updated 3 years ago
- ☆169Updated last year
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆53Updated 5 years ago
- Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.☆30Updated 11 months ago
- ☆50Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Updated last year
- code for paper "Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning"☆17Updated last year