UCB CS294-112 深度强化学习中文笔记
☆51Jan 2, 2021Updated 5 years ago
Alternatives and similar repositories for ucb-cs294-112-notes-zh
Users that are interested in ucb-cs294-112-notes-zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 斯坦福 cs234 强化学习中文讲义☆211Jan 2, 2021Updated 5 years ago
- [译] Python 机器学习在线指南☆16Sep 17, 2020Updated 5 years ago
- PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))☆14Mar 22, 2019Updated 7 years ago
- The code has been implemented in Carla Simulator with the help of Double DQN to train an agent how to drive autonomously using different …☆16Aug 20, 2019Updated 6 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for CIKM 2021 best short paper nomination "Modeling Sequences as Distributions with Uncertainty for Sequential Recommendation" https…☆16Jun 11, 2021Updated 5 years ago
- [译] Java8 中文官方文档(施工中)☆42Sep 17, 2020Updated 5 years ago
- [译] Java 8 简明教程☆11Sep 17, 2020Updated 5 years ago
- On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning by Ameya Pore and Gerardo Aragon-Camarasa☆11Jan 28, 2020Updated 6 years ago
- Review of Reinforcement Learning☆12Dec 27, 2018Updated 7 years ago
- Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10☆12Apr 6, 2022Updated 4 years ago
- [译] Gainlo 面试指南☆19Sep 17, 2020Updated 5 years ago
- PythonProgramming.net 系列教程☆11Mar 19, 2022Updated 4 years ago
- [译] Scikit-learn 秘籍☆53Sep 12, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- iBooker 老实人报☆17Apr 20, 2023Updated 3 years ago
- [译] ApacheCN 计算机系统译文集☆23Jul 7, 2022Updated 3 years ago
- ☆12Sep 17, 2020Updated 5 years ago
- Low-Order modelling of Floating offshore wind Turbines/Farms for grid integration research☆20Aug 9, 2025Updated 10 months ago
- A Chinese learning note with python codes for Pattern Recognition and Machine Learning.☆31Aug 25, 2018Updated 7 years ago
- [译] kudu 中文文档☆28Mar 19, 2022Updated 4 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Deep Visual MPC-Policy Learning for Navigation☆30May 19, 2022Updated 4 years ago
- 工程数学/数值计算方法与算法C++实现_学习☆15Apr 10, 2017Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ADP☆13Apr 12, 2017Updated 9 years ago
- Official code release for Deep Extreme Mixture Model by Wilson, McDonald, Galib, Tan, and Luo.☆10Feb 11, 2022Updated 4 years ago
- Easily manage multiple sessions with telescope integration.☆14Sep 28, 2023Updated 2 years ago
- Multi-robot Reinforcement Learning Scalable Training School (MRST) is a training and evaluation platform for reinforcement learning rease…☆11Sep 6, 2022Updated 3 years ago
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated 4 months ago
- 为C++初学者构建的示范项目,数值计算方向。☆10Dec 27, 2019Updated 6 years ago
- A simple camera board using GMAX3412 1" 4K@30fps global shutter sensor☆22Dec 21, 2025Updated 6 months ago
- ☆10Aug 16, 2022Updated 3 years ago
- Learning globally stable dynamical systems policies through imitation. A modification of the original work, focussing on waypoint-based i…☆14Oct 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Jun 9, 2026Updated 3 weeks ago
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆11Jun 24, 2022Updated 4 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆97Mar 25, 2021Updated 5 years ago
- AI-powered cryptocurrency trading bot built using deep reinforcement learning (DRL). The bot is designed as a research platform for devel…☆11Jan 18, 2025Updated last year
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆15Feb 8, 2024Updated 2 years ago
- A collection of free online materials for control engineering☆21Feb 4, 2025Updated last year