UCB CS294-112 深度强化学习中文笔记
☆51Jan 2, 2021Updated 5 years ago
Alternatives and similar repositories for ucb-cs294-112-notes-zh
Users that are interested in ucb-cs294-112-notes-zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [译] 笨办法学 Linux 中文版☆16Dec 24, 2020Updated 5 years ago
- 斯坦福 cs234 强化学习中文讲义☆211Jan 2, 2021Updated 5 years ago
- [译] Python 机器学习在线指南☆16Sep 17, 2020Updated 5 years ago
- PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))☆14Mar 22, 2019Updated 7 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning by Ameya Pore and Gerardo Aragon-Camarasa☆11Jan 28, 2020Updated 6 years ago
- Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10☆12Apr 6, 2022Updated 4 years ago
- [译] Scikit-learn 秘籍☆53Sep 12, 2019Updated 6 years ago
- ☆10Feb 13, 2022Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 5 years ago
- deploy machine learning model in tensorflow sering and docker☆10Dec 5, 2018Updated 7 years ago
- [译] 百页机器学习小书☆140Sep 17, 2020Updated 5 years ago
- ☆54Jul 5, 2021Updated 4 years ago
- A Chinese learning note with python codes for Pattern Recognition and Machine Learning.☆30Aug 25, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Tensorflow tf.metrics tutorial☆12Aug 30, 2018Updated 7 years ago
- ☆25Jan 18, 2025Updated last year
- ADP☆13Apr 12, 2017Updated 9 years ago
- Official code release for Deep Extreme Mixture Model by Wilson, McDonald, Galib, Tan, and Luo.☆10Feb 11, 2022Updated 4 years ago
- Easily manage multiple sessions with telescope integration.☆14Sep 28, 2023Updated 2 years ago
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated 3 months ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- Converts keras trained models to frozen tensorflow protocol buffers for use with the c++ tensorflow api☆10Sep 28, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 【译】UIUC CS241 系统编程中文讲义☆49Aug 17, 2022Updated 3 years ago
- Learning globally stable dynamical systems policies through imitation. A modification of the original work, focussing on waypoint-based i…☆13Oct 12, 2024Updated last year
- ☆17Updated this week
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆11Jun 24, 2022Updated 3 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆97Mar 25, 2021Updated 5 years ago
- Deep Reinforcement Learning with continuous control in CARLA☆11Dec 8, 2022Updated 3 years ago
- TensorFlow中文教程☆11Oct 21, 2016Updated 9 years ago
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆15Feb 8, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Distributed Multi-Object Tracking Under Limited Field of View Sensors.☆21Oct 8, 2021Updated 4 years ago
- A learning-based scheme to capture external force/torque caused by payload of tethered-UAV system☆21May 27, 2025Updated last year
- ☆13Aug 23, 2023Updated 2 years ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated 2 years ago
- A collection of free online materials for control engineering☆21Feb 4, 2025Updated last year
- Analizador de tráfico para dispositivos Android potencialmente comprometidos como parte de una botnet orientado a detectar ataques DDoS.☆13Jun 20, 2018Updated 7 years ago
- ☆12Jan 3, 2022Updated 4 years ago