Personal Repo to keep track of RL papers
☆31May 3, 2021Updated 4 years ago
Alternatives and similar repositories for RLPaperList
Users that are interested in RLPaperList are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆24Mar 27, 2020Updated 6 years ago
- ☆92Dec 5, 2023Updated 2 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.☆11Feb 6, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ACL19_Depth_Growing_for_Neural_Machine_Translation☆23Jul 6, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- ☆15Sep 25, 2019Updated 6 years ago
- Reinforcement Learning papers on exploration methods.☆19Jun 27, 2021Updated 4 years ago
- Code for "Multi-task Reinforcement Learning with Soft Modularization"☆124Dec 28, 2020Updated 5 years ago
- ☆13Mar 16, 2023Updated 3 years ago
- This is a program to solve NER with HMM. The principles and details can refer to my blog: https://blog.csdn.net/weixin_41679411/article/d…☆11Nov 20, 2018Updated 7 years ago
- ☆399Jul 18, 2019Updated 6 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Robust policy search algorithms which train on model ensembles☆31Oct 26, 2016Updated 9 years ago
- Cat Detection and Breed Recognition☆16Oct 27, 2018Updated 7 years ago
- ☆62Jun 22, 2018Updated 7 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION☆25Jan 26, 2026Updated 2 months ago
- Geometry generation/planning for robotically assembled spatial structures☆14Mar 23, 2023Updated 3 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A dependency free library of standardized optimization test functions written in pure Python.☆62Dec 15, 2025Updated 3 months ago
- ☆11Oct 26, 2022Updated 3 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 9 months ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 8 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 5 months ago
- ☆12Sep 15, 2021Updated 4 years ago
- This repository not only contains experience about parameter finetune, but also other in-practice experience such as model ensemble (boos…☆16Oct 29, 2017Updated 8 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Learning to Incentivize Other Learning Agents☆36Jun 13, 2022Updated 3 years ago
- Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…☆17Jun 25, 2019Updated 6 years ago
- Deep Q-Network (DQN) to play classic Atari Games☆11Sep 18, 2017Updated 8 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆197Dec 8, 2022Updated 3 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Code related to the paper "Asynchronous Batch Bayesian Optimisation with Improved Local Penalisation"☆13May 8, 2019Updated 6 years ago
- ☆10Oct 15, 2020Updated 5 years ago