Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition
☆125Apr 15, 2019Updated 6 years ago
Alternatives and similar repositories for rl-intro-book-chinese
Users that are interested in rl-intro-book-chinese are comparing it to the libraries listed below
Sorting:
- A translation of Reinforcement Learning: An Introduction☆114Aug 20, 2018Updated 7 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆655Apr 9, 2022Updated 3 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- Implementation of SNAIL(A Simple Neural Attentive Meta-Learner) with Gluon☆12Feb 22, 2019Updated 7 years ago
- discrete gate sizing☆14Nov 23, 2020Updated 5 years ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆19Jun 17, 2021Updated 4 years ago
- Python Implementation of Reinforcement Learning: An Introduction☆14,577Aug 9, 2024Updated last year
- 中文整理的强化学习资料(Reinforcement Learning)☆2,145Apr 30, 2020Updated 5 years ago
- (TPAMI) Human-guided Reinforcement Learning with Sim-to-real Transfer for Autonomous Navigation☆25Sep 18, 2023Updated 2 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆269Dec 4, 2018Updated 7 years ago
- ☆20Mar 1, 2019Updated 7 years ago
- Official code for Generative Marginalization Models [ICML 2024] [SPGIM 2023 Workshop Oral]☆24Aug 19, 2024Updated last year
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Oct 8, 2016Updated 9 years ago
- SeqGAN but with more bells and whistles☆24Feb 15, 2018Updated 8 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago
- A Fast and Open Source Autonomous Perception System.☆26Nov 23, 2022Updated 3 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Aug 25, 2017Updated 8 years ago
- Implementation of Deepmind's Neural Episodic Control☆58May 9, 2018Updated 7 years ago
- ☆28Oct 9, 2017Updated 8 years ago
- Neuronales Netz zur Trajektorienprädiktion von Fahrzeugen mit OnlineLearning☆34Mar 2, 2023Updated 3 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Apr 14, 2021Updated 4 years ago
- ☆28Apr 30, 2019Updated 6 years ago
- Navigation agent with Bayesian relational memory in the House3D environment☆30Sep 13, 2019Updated 6 years ago
- ☆119Jul 9, 2020Updated 5 years ago
- ☆31Feb 20, 2021Updated 5 years ago
- MLCAD 2020: Reinforcement for logic optimization sequence exploration☆29Oct 17, 2020Updated 5 years ago
- ☆28Apr 28, 2019Updated 6 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆81Jan 19, 2019Updated 7 years ago
- ☆33Nov 21, 2022Updated 3 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Oct 18, 2022Updated 3 years ago
- A Multi-agent Learning Framework☆62May 10, 2021Updated 4 years ago
- Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone☆2,559Apr 11, 2022Updated 3 years ago
- ☆11Dec 23, 2024Updated last year
- Software and hardware to test VR latency on any device☆10May 1, 2018Updated 7 years ago
- Code for hierarchical imitation learning and reinforcement learning☆301Mar 14, 2018Updated 7 years ago
- ☆213Apr 11, 2017Updated 8 years ago
- Visual Navigation with Spatial Attention☆37Jan 2, 2025Updated last year