Personal Repo to keep track of RL papers
☆31May 3, 2021Updated 5 years ago
Alternatives and similar repositories for RLPaperList
Users that are interested in RLPaperList are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆24Mar 27, 2020Updated 6 years ago
- Code for the paper "SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks"☆12Jan 17, 2023Updated 3 years ago
- ☆92Dec 5, 2023Updated 2 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.☆11Feb 6, 2023Updated 3 years ago
- ACL19_Depth_Growing_for_Neural_Machine_Translation☆23Jul 6, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- ☆16Sep 25, 2019Updated 6 years ago
- Reinforcement Learning papers on exploration methods.☆19Jun 27, 2021Updated 4 years ago
- Code for "Multi-task Reinforcement Learning with Soft Modularization"☆124Dec 28, 2020Updated 5 years ago
- Efficient Exploration via State Marginal Matching (2019)☆70Jun 30, 2019Updated 6 years ago
- ☆13Mar 16, 2023Updated 3 years ago
- ☆399Jul 18, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Robust policy search algorithms which train on model ensembles☆31Oct 26, 2016Updated 9 years ago
- Cat Detection and Breed Recognition☆16Oct 27, 2018Updated 7 years ago
- ☆62Jun 22, 2018Updated 7 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 7 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆38Feb 5, 2019Updated 7 years ago
- 在Kaggle比赛 Home Credit Default Risk中测试gplearn进行特征工程的效果☆10Jul 18, 2018Updated 7 years ago
- Geometry generation/planning for robotically assembled spatial structures☆14Mar 23, 2023Updated 3 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The project codes up a three hidden layer deep auto encoder, trained in a greedy layerwise fashion for initializing a corresponding deep …☆11Mar 19, 2017Updated 9 years ago
- A dependency free library of standardized optimization test functions written in pure Python.☆62Dec 15, 2025Updated 5 months ago
- ☆11Oct 26, 2022Updated 3 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 11 months ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 8 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 7 months ago
- ☆13Sep 15, 2021Updated 4 years ago
- This repository not only contains experience about parameter finetune, but also other in-practice experience such as model ensemble (boos…☆16Oct 29, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Learning to Incentivize Other Learning Agents☆36Jun 13, 2022Updated 3 years ago
- Deep Q-Network (DQN) to play classic Atari Games☆11Sep 18, 2017Updated 8 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Code related to the paper "Asynchronous Batch Bayesian Optimisation with Improved Local Penalisation"☆13May 8, 2019Updated 7 years ago
- Using Natural Language for Reward Shaping in Reinforcement Learning☆24Dec 11, 2023Updated 2 years ago
- ☆10Oct 15, 2020Updated 5 years ago