rlchina / rlchina.github.io
☆10Updated 4 years ago
Related projects: ⓘ
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆27Updated 4 years ago
- ☆16Updated 6 years ago
- Dynamic Partial Removal: a Neural Network Heuristic for Large Neighborhood Search on Combinatorial Optimization Problems, by applying dee…☆18Updated 4 years ago
- 1st solution for KDD Cup 2020 (RL track)☆57Updated 4 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Updated 5 years ago
- ☆24Updated 4 years ago
- Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022☆19Updated 2 years ago
- ☆35Updated 4 years ago
- ☆10Updated 3 years ago
- The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.☆84Updated 3 years ago
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆11Updated 6 years ago
- A pack of reinforcement learning algorithms.☆80Updated 2 years ago
- Implementation of Scheduled Policy Optimization for task-oriented language grouding☆29Updated 6 years ago
- Code release for AAAI 2020 paper "Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems"☆35Updated 2 months ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 4 years ago
- A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.☆68Updated 4 years ago
- Budget Constrained Bidding for Display Advertising using Model-free Reinforcement Learning☆39Updated 4 years ago
- Hierarchical deep reinforcement learning for combinatorial optimization problem☆33Updated 4 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 5 years ago
- ☆38Updated this week
- Illustrated Examples from Sutton and Barto☆35Updated last year
- FEN Code☆36Updated 4 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆54Updated 5 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆28Updated 5 years ago
- Code the AAAI 2019 paper "Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization"☆27Updated 3 years ago
- A Multi-agent Learning Framework☆61Updated 3 years ago
- ☆20Updated 3 years ago
- This is a repository of the experiment code supporting the paper Real-time Bidding Strategy in Display Advertising: An Empirical Analysis…☆24Updated last year
- RainBow, Tensorflow☆49Updated 6 years ago