UCB CS294-112 深度强化学习中文笔记
☆51Jan 2, 2021Updated 5 years ago
Alternatives and similar repositories for ucb-cs294-112-notes-zh
Users that are interested in ucb-cs294-112-notes-zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [译] 笨办法学 Linux 中文版☆16Dec 24, 2020Updated 5 years ago
- 斯坦福 cs234 强化学习中文讲义☆208Jan 2, 2021Updated 5 years ago
- PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))☆14Mar 22, 2019Updated 7 years ago
- The code has been implemented in Carla Simulator with the help of Double DQN to train an agent how to drive autonomously using different …☆16Aug 20, 2019Updated 6 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- Code for CIKM 2021 best short paper nomination "Modeling Sequences as Distributions with Uncertainty for Sequential Recommendation" https…☆16Jun 11, 2021Updated 4 years ago
- [译] Java8 中文官方文档(施工中)☆43Sep 17, 2020Updated 5 years ago
- On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning by Ameya Pore and Gerardo Aragon-Camarasa☆10Jan 28, 2020Updated 6 years ago
- Review of Reinforcement Learning☆12Dec 27, 2018Updated 7 years ago
- Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10☆12Apr 6, 2022Updated 3 years ago
- PythonProgramming.net 系列教程☆11Mar 19, 2022Updated 4 years ago
- [译] Scikit-learn 秘籍☆54Sep 12, 2019Updated 6 years ago
- iBooker 老实人报☆16Apr 20, 2023Updated 2 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆22Aug 4, 2022Updated 3 years ago
- Low-Order modelling of Floating offshore wind Turbines/Farms for grid integration research☆19Aug 9, 2025Updated 7 months ago
- This is the code for Q-value Path Decomposition for Deep Multiagent Reinforcement Learning (NeurIPS 2019).☆12May 20, 2019Updated 6 years ago
- ☆10Feb 13, 2022Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- deploy machine learning model in tensorflow sering and docker☆10Dec 5, 2018Updated 7 years ago
- ☆54Jul 5, 2021Updated 4 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- This repo has scripts to compare various powerful RL methods☆39Feb 23, 2026Updated last month
- Tensorflow tf.metrics tutorial☆12Aug 30, 2018Updated 7 years ago
- Numba 0.44 中文文档☆34Aug 2, 2023Updated 2 years ago
- AI-powered cryptocurrency trading bot built using deep reinforcement learning (DRL). The bot is designed as a research platform for devel…☆10Jan 18, 2025Updated last year
- ☆25Jan 18, 2025Updated last year
- Deep Visual MPC-Policy Learning for Navigation☆30May 19, 2022Updated 3 years ago
- ADP☆12Apr 12, 2017Updated 8 years ago
- Official code release for Deep Extreme Mixture Model by Wilson, McDonald, Galib, Tan, and Luo.☆10Feb 11, 2022Updated 4 years ago
- Easily manage multiple sessions with telescope integration.☆14Sep 28, 2023Updated 2 years ago
- Multi-robot Reinforcement Learning Scalable Training School (MRST) is a training and evaluation platform for reinforcement learning rease…☆11Sep 6, 2022Updated 3 years ago
- ☆13Apr 28, 2021Updated 4 years ago
- Distributed Multi-Object Tracking Under Limited Field of View Sensors.☆20Oct 8, 2021Updated 4 years ago
- A simple camera board using GMAX3412 1" 4K@30fps global shutter sensor☆20Dec 21, 2025Updated 3 months ago
- Learning globally stable dynamical systems policies through imitation. A modification of the original work, focussing on waypoint-based i…☆12Oct 12, 2024Updated last year
- ☆10Aug 16, 2022Updated 3 years ago
- ☆16May 31, 2024Updated last year
- Reinforcement Learning from Hierarchical Critics☆13Jul 30, 2020Updated 5 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Jun 24, 2022Updated 3 years ago