dbsxdbsx / rl-intro-book-chineseView external linksLinks
Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition
☆126Apr 15, 2019Updated 6 years ago
Alternatives and similar repositories for rl-intro-book-chinese
Users that are interested in rl-intro-book-chinese are comparing it to the libraries listed below
Sorting:
- A translation of Reinforcement Learning: An Introduction☆114Aug 20, 2018Updated 7 years ago
- A Q & A system based on Chinese wikipedia knowledge☆19May 26, 2017Updated 8 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- discrete gate sizing☆14Nov 23, 2020Updated 5 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- Keras (re)implementation of paper "Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks. SIGIR, 2015"☆67Dec 1, 2016Updated 9 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆19Jun 17, 2021Updated 4 years ago
- Python Implementation of Reinforcement Learning: An Introduction☆14,560Aug 9, 2024Updated last year
- 中文整理的强化学习资料(Reinforcement Learning)☆2,141Apr 30, 2020Updated 5 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- (TPAMI) Human-guided Reinforcement Learning with Sim-to-real Transfer for Autonomous Navigation☆25Sep 18, 2023Updated 2 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆269Dec 4, 2018Updated 7 years ago
- ☆20Mar 1, 2019Updated 6 years ago
- Official code for Generative Marginalization Models [ICML 2024] [SPGIM 2023 Workshop Oral]☆24Aug 19, 2024Updated last year
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Oct 8, 2016Updated 9 years ago
- Keras implementation of ABCNN by Yin & Schütze (WIP)☆23Jun 16, 2020Updated 5 years ago
- SeqGAN but with more bells and whistles☆24Feb 15, 2018Updated 8 years ago
- ☆25Aug 2, 2024Updated last year
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago
- Implementation of Deepmind's Neural Episodic Control☆58May 9, 2018Updated 7 years ago
- Neuronales Netz zur Trajektorienprädiktion von Fahrzeugen mit OnlineLearning☆34Mar 2, 2023Updated 2 years ago
- ☆28Oct 9, 2017Updated 8 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Apr 14, 2021Updated 4 years ago
- ☆30Feb 20, 2021Updated 4 years ago
- ☆28Apr 30, 2019Updated 6 years ago
- A pyTorch implementation of models used for Recognizing Textual Entailment using the SNLI corpus☆33Aug 12, 2017Updated 8 years ago
- ☆119Jul 9, 2020Updated 5 years ago
- MLCAD 2020: Reinforcement for logic optimization sequence exploration☆29Oct 17, 2020Updated 5 years ago
- ☆28Apr 28, 2019Updated 6 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆80Jan 19, 2019Updated 7 years ago
- ☆33Nov 21, 2022Updated 3 years ago
- A Multi-agent Learning Framework☆62May 10, 2021Updated 4 years ago
- Tree-Invent: A novel molecular generative model constrained with topological tree☆13Jul 26, 2023Updated 2 years ago
- ☆11Dec 23, 2024Updated last year
- Baselines and memory-based scenarios for the ViZDoom simulator☆36Dec 8, 2022Updated 3 years ago
- Software and hardware to test VR latency on any device☆10May 1, 2018Updated 7 years ago
- Code for hierarchical imitation learning and reinforcement learning☆301Mar 14, 2018Updated 7 years ago