Series of deep reinforcement learning algorithms 🤖
☆29Jun 19, 2021Updated 4 years ago
Alternatives and similar repositories for rl_lib
Users that are interested in rl_lib are comparing it to the libraries listed below
Sorting:
- ☆13Aug 9, 2022Updated 3 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 6 years ago
- Online Service Function Chain Deployment for Live-Video virtualized Content Delivery Networks, a Deep Reinforcement Learning approach pap…☆10Nov 8, 2021Updated 4 years ago
- ☆24Jan 12, 2021Updated 5 years ago
- (NeurIPS 2018) Hardware Conditioned Policies for Multi-Robot Transfer Learning☆20Apr 8, 2019Updated 6 years ago
- ☆14Apr 8, 2021Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Jan 12, 2019Updated 7 years ago
- Hierarchical Attention in Reinforcement Learning for Stock Order Executions☆32Apr 7, 2021Updated 4 years ago
- [ICLR 2024]: Is Self-Repair a Silver Bullet for Code Generation?☆15May 2, 2024Updated last year
- Propose & vote on reading group papers in the "Discussions" tab.☆12Feb 20, 2024Updated 2 years ago
- Regular Expression Builder for Python☆16May 26, 2021Updated 4 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- Proportional Navigation tool for python >=3.10☆20Jan 5, 2026Updated 2 months ago
- ☆12Feb 14, 2022Updated 4 years ago
- The RL discord wiki☆258Oct 20, 2020Updated 5 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- twenty lectures on algorithmic game theory☆10May 11, 2021Updated 4 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆80Oct 25, 2020Updated 5 years ago
- Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)☆14Feb 20, 2023Updated 3 years ago
- ☆12Dec 29, 2019Updated 6 years ago
- ☆12Aug 15, 2020Updated 5 years ago
- Easily serialize dataclasses to and from tensors (PyTorch, NumPy)☆18Apr 10, 2021Updated 4 years ago
- OS first assignment, UESTC, 电子科技大学, CS ,计科,操作系统☆11Jun 19, 2017Updated 8 years ago
- ☆12Sep 30, 2017Updated 8 years ago
- The Panda Driver provides a series of components for initalising and controlling the Franka-Emika Panda robotic arm.☆16May 22, 2022Updated 3 years ago
- Official code for "Traffic Speed Imputation with Spatio-Temporal Attentions and Cycle-Perceptual Training" (CIKM'22).☆13Mar 8, 2024Updated 2 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆37Mar 24, 2023Updated 2 years ago
- Solution of KDD cup 2021☆11Jun 16, 2021Updated 4 years ago
- An official repository for a VAE tutorial of Probabilistic Modelling and Reasoning - a University of Edinburgh master's course.☆10Jan 2, 2024Updated 2 years ago
- A PyTorch implementation of the ACM SIGKDD 2021 paper titled "PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-base…☆17Dec 19, 2023Updated 2 years ago
- Cooperation and Fairness in Multi-Agent Reinforcement Learning☆16Aug 6, 2025Updated 7 months ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Mar 12, 2020Updated 6 years ago
- A thorough, straightforward, un-intimidating introduction to Gaussian processes in NumPy.☆16Jun 12, 2018Updated 7 years ago
- Twitter-NFT sales bot that tweets individual and sweep sales with images from Opensea, Looksrare, X2Y2, and Blur using Opensea/Looksrare …☆13Jul 27, 2023Updated 2 years ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- Project on Causal Machine learning CS 7290☆16Dec 7, 2019Updated 6 years ago
- Cookiecutter PyTorch Lightning☆12Sep 7, 2021Updated 4 years ago
- Code and data of the CCS '22 paper titled "Understanding Security Issues in the NFT Ecosystem"☆11Dec 20, 2022Updated 3 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago