UCB CS294-112 深度强化学习中文笔记
☆51Jan 2, 2021Updated 5 years ago
Alternatives and similar repositories for ucb-cs294-112-notes-zh
Users that are interested in ucb-cs294-112-notes-zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [译] 笨办法学 Linux 中文版☆16Dec 24, 2020Updated 5 years ago
- 斯坦福 cs234 强化学习中文讲义☆209Jan 2, 2021Updated 5 years ago
- [译] Python 机器学习在线指南☆17Sep 17, 2020Updated 5 years ago
- PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))☆14Mar 22, 2019Updated 7 years ago
- The code has been implemented in Carla Simulator with the help of Double DQN to train an agent how to drive autonomously using different …☆16Aug 20, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Review of Reinforcement Learning☆12Dec 27, 2018Updated 7 years ago
- Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10☆12Apr 6, 2022Updated 4 years ago
- PythonProgramming.net 系列教程☆11Mar 19, 2022Updated 4 years ago
- iBooker 老实人报☆17Apr 20, 2023Updated 3 years ago
- [译] ApacheCN 计算机系统译文集☆23Jul 7, 2022Updated 3 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆22Aug 4, 2022Updated 3 years ago
- Low-Order modelling of Floating offshore wind Turbines/Farms for grid integration research☆20Aug 9, 2025Updated 9 months ago
- ☆10Feb 13, 2022Updated 4 years ago
- [译] 百页机器学习小书☆141Sep 17, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆54Jul 5, 2021Updated 4 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- ☆25Jan 18, 2025Updated last year
- 基于定向A*算法的多无人机航迹规划分步策略☆11Aug 26, 2018Updated 7 years ago
- ADP☆13Apr 12, 2017Updated 9 years ago
- Official code release for Deep Extreme Mixture Model by Wilson, McDonald, Galib, Tan, and Luo.☆10Feb 11, 2022Updated 4 years ago
- ☆13Feb 1, 2025Updated last year
- Multi-robot Reinforcement Learning Scalable Training School (MRST) is a training and evaluation platform for reinforcement learning rease…☆11Sep 6, 2022Updated 3 years ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Apr 28, 2021Updated 5 years ago
- 为C++ 初学者构建的示范项目,数值计算方向。☆10Dec 27, 2019Updated 6 years ago
- A simple camera board using GMAX3412 1" 4K@30fps global shutter sensor☆21Dec 21, 2025Updated 5 months ago
- Learning globally stable dynamical systems policies through imitation. A modification of the original work, focussing on waypoint-based i…☆13Oct 12, 2024Updated last year
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- Deep Reinforcement Learning with continuous control in CARLA☆11Dec 8, 2022Updated 3 years ago
- A learning-based scheme to capture external force/torque caused by payload of tethered-UAV system☆20May 27, 2025Updated 11 months ago
- ☆13Aug 23, 2023Updated 2 years ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A collection of free online materials for control engineering☆20Feb 4, 2025Updated last year
- Analizador de tráfico para dispositivos Android potencialmente comprometidos como parte de una botnet orientado a detectar ataques DDoS.☆13Jun 20, 2018Updated 7 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- Lark 套件(飞书)Linux 客户端 release。非官方。☆10Jul 3, 2021Updated 4 years ago
- Multi Type Mean Field Reinforcement Learning☆31Jun 13, 2022Updated 3 years ago
- ☆10Dec 9, 2021Updated 4 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago