zuoxingdong / gym-recsysLinks
Customizable RecSys Simulator for OpenAI Gym
☆26Updated 3 years ago
Alternatives and similar repositories for gym-recsys
Users that are interested in gym-recsys are comparing it to the libraries listed below
Sorting:
- ☆50Updated last year
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 6 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago
- ☆89Updated last year
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆59Updated last year
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆99Updated 3 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- Toy environment set for multi-agent reinforcement learning and more☆39Updated 11 months ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆38Updated 2 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 6 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 6 years ago
- ☆68Updated 2 years ago
- A toolkit of Reinforcement Learning based Recommendation (RL4Rec)☆27Updated 3 years ago
- Reinforcement learning algorithms in RLlib☆59Updated last year
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Updated 4 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 5 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 4 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- Understanding RL vision Distill article☆24Updated 2 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).☆90Updated 6 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆37Updated 5 years ago
- ☆31Updated 6 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 4 years ago