rlchina / rlchina.github.io

☆10

Related projects: ⓘ

tjuHaoXiaotian / ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
☆27Updated 4 years ago
yudasong / Reinforcement-Learning-Branch-and-Bound
☆16Updated 6 years ago
water-mirror / DPR
Dynamic Partial Removal: a Neural Network Heuristic for Large Neighborhood Search on Combinatorial Optimization Problems, by applying dee…
☆18Updated 4 years ago
maybeluo / KDDCup2020-RL-1st-solution
1st solution for KDD Cup 2020 (RL track)
☆57Updated 4 years ago
eyounx / PRR
Meta-Reinforcement Learning with Policy Residual Representation
☆11Updated 5 years ago
venktesh22 / ExpressLanes_Deep-RL
☆24Updated 4 years ago
chaovven / maab
Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022
☆19Updated 2 years ago
jingw2 / neural-combinatorial-optimization-rl
☆35Updated 4 years ago
ma-aouad / DynamicMNL
☆10Updated 3 years ago
mktal / kddcup-starting-kit
The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.
☆84Updated 3 years ago
CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆11Updated 6 years ago
liber145 / rlpack
A pack of reinforcement learning algorithms.
☆80Updated 2 years ago
xwhan / walk_the_blocks
Implementation of Scheduled Policy Optimization for task-oriented language grouding
☆29Updated 6 years ago
JayMan91 / aaai_predit_then_optimize
Code release for AAAI 2020 paper "Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems"
☆35Updated 2 months ago
henryslzhao / RL4Recsys
paper list in the area of reinforcenment learning for recommendation systems
☆24Updated 4 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 4 years ago
Jinjiarui / rl4rs-papers
A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.
☆68Updated 4 years ago
venkatacrc / Budget_Constrained_Bidding
Budget Constrained Bidding for Display Advertising using Model-free Reinforcement Learning
☆39Updated 4 years ago
qiang-ma / HRL-for-combinatorial-optimization
Hierarchical deep reinforcement learning for combinatorial optimization problem
☆33Updated 4 years ago
cxxgtxy / POP3D
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
☆44Updated 5 years ago
KaiYan289 / RLpapersnote
☆38Updated this week
laxatives / rl
Illustrated Examples from Sutton and Barto
☆35Updated last year
PKU-RL / FEN
FEN Code
☆36Updated 4 years ago
lightaime / TensorAgent
Deep reinforcement learning agents implement by tensorflow https://ghli.org
☆54Updated 5 years ago
Lunj12 / RL-Bandits-with-Knapsacks
Dynamic Pricing BwK Problem and Reinforcement Learning
☆28Updated 5 years ago
bwilder0 / aaai_melding_code
Code the AAAI 2019 paper "Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization"
☆27Updated 3 years ago
ying-wen / malib_deprecated
A Multi-agent Learning Framework
☆61Updated 3 years ago
liangxinedu / stepwise
☆20Updated 3 years ago
hzn666 / RLBid_EA
This is a repository of the experiment code supporting the paper Real-time Bidding Strategy in Display Advertising: An Empirical Analysis…
☆24Updated last year
cmusjtuliuyuan / RainBow
RainBow, Tensorflow
☆49Updated 6 years ago