Fengyuan-Shi / ADP_course_given_by_BERTSEKAS_2014_THU
Video and other material of ADP course given by BERTSEKAS at THU, 2014
☆56Updated 5 years ago
Alternatives and similar repositories for ADP_course_given_by_BERTSEKAS_2014_THU:
Users that are interested in ADP_course_given_by_BERTSEKAS_2014_THU are comparing it to the libraries listed below
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆58Updated 4 years ago
- ADP demo code for Reinforcement Learning and Control, Tsinghua Univ. Lecture Notes.☆40Updated 2 years ago
- A curated list of awesome video lectures and learning resources for operations analytics.☆29Updated 3 years ago
- Approximate Dynamic Programming exercises from Powell (2011)☆14Updated last year
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆57Updated 4 years ago
- Bilevel Optimization Algorithm☆39Updated 2 years ago
- ☆30Updated 6 years ago
- Nash Q Learning☆30Updated 4 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆41Updated 4 years ago
- Model-free policy gradient algorithm for LQR☆10Updated 4 years ago
- The repository archives papers regarding the combination of combinatorial optimization and machine learning and corresponding reading not…☆160Updated 4 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆70Updated last year
- ☆16Updated 6 years ago
- Dynamic Partial Removal: a Neural Network Heuristic for Large Neighborhood Search on Combinatorial Optimization Problems, by applying dee…☆18Updated 4 years ago
- ☆35Updated 4 years ago
- Paper list of multi-agent reinforcement learning (MARL)☆27Updated 3 years ago
- Code for paper publication: Deep reinforcement learning-based solution for a multi-objective online order batching problem☆11Updated 2 years ago
- Parameterizing Branch-and-Bound Search Trees to Learn Branching Policies (AAAI 2021)☆67Updated 3 years ago
- This repository includes a realization of the resilient projection-based consensus actor-critic algorithm that is resilient to adversaria…☆10Updated 2 years ago
- Code implementation for NeurIPS 2019 submission 'Reinforcement Learning for Integer Programming: Learning to Cut'☆35Updated 5 years ago
- ☆39Updated 2 months ago
- Transformer-based Multi-Agent Actor-Critic Framework☆44Updated 2 years ago
- Approximate dynamic programming (ADP) and Policy gradient (PG) based sequential optimal experimental design (sOED)☆19Updated 2 years ago
- ☆16Updated 4 years ago
- 运筹OR帷幄 强化学习学习小组☆24Updated 3 years ago
- Hierarchical deep reinforcement learning for combinatorial optimization problem☆35Updated 5 years ago
- Code release for AAAI 2020 paper "Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems"☆38Updated 7 months ago
- Implementation of point-based value iteration (for POMDPs)☆12Updated 4 years ago
- ☆20Updated 3 years ago
- Meta-Learning-based Deep Reinforcement Learning for Multiobjective Optimization Problems☆32Updated 8 months ago