UCB CS294-112 深度强化学习中文笔记
☆51Jan 2, 2021Updated 5 years ago
Alternatives and similar repositories for ucb-cs294-112-notes-zh
Users that are interested in ucb-cs294-112-notes-zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [译] 笨办法学 Linux 中文版☆16Dec 24, 2020Updated 5 years ago
- 斯坦福 cs234 强化学习中文讲义☆208Jan 2, 2021Updated 5 years ago
- [译] Python 机器学习在线指南☆17Sep 17, 2020Updated 5 years ago
- PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))☆14Mar 22, 2019Updated 7 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [译] Java8 中文官方文档(施工中)☆42Sep 17, 2020Updated 5 years ago
- [译] Java 8 简明教程☆11Sep 17, 2020Updated 5 years ago
- On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning by Ameya Pore and Gerardo Aragon-Camarasa☆11Jan 28, 2020Updated 6 years ago
- Review of Reinforcement Learning☆12Dec 27, 2018Updated 7 years ago
- Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10☆12Apr 6, 2022Updated 4 years ago
- PythonProgramming.net 系列教程☆11Mar 19, 2022Updated 4 years ago
- [译] Scikit-learn 秘籍☆54Sep 12, 2019Updated 6 years ago
- iBooker 老实人报☆17Apr 20, 2023Updated 3 years ago
- [译] ApacheCN 计算机系统译文集☆23Jul 7, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆22Aug 4, 2022Updated 3 years ago
- ☆54Jul 5, 2021Updated 4 years ago
- 把《Object-Oriented Programming With ANSI-C》翻译成简体中文版本,原书作者:Axel-Tobias Schreiner☆22Mar 15, 2014Updated 12 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Tensorflow tf.metrics tutorial☆12Aug 30, 2018Updated 7 years ago
- AI-powered cryptocurrency trading bot built using deep reinforcement learning (DRL). The bot is designed as a research platform for devel…☆10Jan 18, 2025Updated last year
- ☆25Jan 18, 2025Updated last year
- Deep Visual MPC-Policy Learning for Navigation☆30May 19, 2022Updated 3 years ago
- Official code release for Deep Extreme Mixture Model by Wilson, McDonald, Galib, Tan, and Luo.☆10Feb 11, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Easily manage multiple sessions with telescope integration.☆14Sep 28, 2023Updated 2 years ago
- Multi-robot Reinforcement Learning Scalable Training School (MRST) is a training and evaluation platform for reinforcement learning rease…☆11Sep 6, 2022Updated 3 years ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- ☆13Apr 28, 2021Updated 5 years ago
- Converts keras trained models to frozen tensorflow protocol buffers for use with the c++ tensorflow api☆10Sep 28, 2018Updated 7 years ago
- A simple camera board using GMAX3412 1" 4K@30fps global shutter sensor☆21Dec 21, 2025Updated 4 months ago
- ☆10Aug 16, 2022Updated 3 years ago
- ☆17May 31, 2024Updated last year
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Jun 24, 2022Updated 3 years ago
- Deep Reinforcement Learning with continuous control in CARLA☆11Dec 8, 2022Updated 3 years ago
- ☆13Aug 23, 2023Updated 2 years ago
- A learning-based scheme to capture external force/torque caused by payload of tethered-UAV system☆20May 27, 2025Updated 11 months ago
- A collection of free online materials for control engineering☆20Feb 4, 2025Updated last year
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated 2 years ago
- Analizador de tráfico para dispositivos Android potencialmente comprometidos como parte de una botnet orientado a detectar ataques DDoS.☆13Jun 20, 2018Updated 7 years ago