zuoxingdong / gym-recsysLinks
Customizable RecSys Simulator for OpenAI Gym
☆26Updated 3 years ago
Alternatives and similar repositories for gym-recsys
Users that are interested in gym-recsys are comparing it to the libraries listed below
Sorting:
- ☆51Updated last year
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆99Updated 3 years ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆38Updated 2 years ago
- ☆88Updated last year
- ☆67Updated 2 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆11Updated 2 years ago
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆69Updated 4 years ago
- Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset☆57Updated 5 years ago
- (RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"☆24Updated 2 years ago
- A toolkit of Reinforcement Learning based Recommendation (RL4Rec)☆26Updated 3 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- Understanding RL vision Distill article☆24Updated 2 years ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆21Updated 3 years ago
- A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.☆21Updated 4 years ago
- Implementation of variational autoencoders for collaborative filtering in PyTorch☆25Updated 6 years ago
- RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems☆122Updated 3 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- Thompson Sampling Tutorial☆54Updated 6 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆25Updated 5 years ago
- Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control☆26Updated 2 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- Offline evaluation of multi-armed bandit algorithms☆23Updated 4 years ago
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆59Updated last year
- Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update☆75Updated 11 months ago
- ☆53Updated 5 years ago
- Bandit algorithms simulations for online learning☆88Updated 5 years ago