Personal Repo to keep track of RL papers
☆31May 3, 2021Updated 4 years ago
Alternatives and similar repositories for RLPaperList
Users that are interested in RLPaperList are comparing it to the libraries listed below
Sorting:
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆24Mar 27, 2020Updated 5 years ago
- ☆92Dec 5, 2023Updated 2 years ago
- An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.☆11Feb 6, 2023Updated 3 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- ☆15Sep 25, 2019Updated 6 years ago
- Reinforcement Learning papers on exploration methods.☆19Jun 27, 2021Updated 4 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- ☆13Mar 16, 2023Updated 3 years ago
- This is a program to solve NER with HMM. The principles and details can refer to my blog: https://blog.csdn.net/weixin_41679411/article/d…☆11Nov 20, 2018Updated 7 years ago
- ☆399Jul 18, 2019Updated 6 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- ☆62Jun 22, 2018Updated 7 years ago
- Robust policy search algorithms which train on model ensembles☆30Oct 26, 2016Updated 9 years ago
- Cat Detection and Breed Recognition☆16Oct 27, 2018Updated 7 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆38Feb 5, 2019Updated 7 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Geometry generation/planning for robotically assembled spatial structures☆14Mar 23, 2023Updated 2 years ago
- 在Kaggle比赛 Home Credit Default Risk中测试gplearn进行特征工程的效果☆10Jul 18, 2018Updated 7 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- A dependency free library of standardized optimization test functions written in pure Python.☆62Dec 15, 2025Updated 3 months ago
- ☆10Oct 26, 2022Updated 3 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 8 months ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 7 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 4 months ago
- ☆12Sep 15, 2021Updated 4 years ago
- This repository not only contains experience about parameter finetune, but also other in-practice experience such as model ensemble (boos…☆16Oct 29, 2017Updated 8 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Learning to Incentivize Other Learning Agents☆36Jun 13, 2022Updated 3 years ago
- Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…☆17Jun 25, 2019Updated 6 years ago
- Deep Q-Network (DQN) to play classic Atari Games☆11Sep 18, 2017Updated 8 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆198Dec 8, 2022Updated 3 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago