课程笔记,David Silver,CS294 ...
☆15Jan 7, 2019Updated 7 years ago
Alternatives and similar repositories for note-on-Deep-Reinforcement-Learning
Users that are interested in note-on-Deep-Reinforcement-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for URDF parsing code☆13Jun 3, 2026Updated last week
- 斯坦福大学 机器学习 吴恩达 Coursera☆14Feb 8, 2018Updated 8 years ago
- 移动机器人轨迹生成相关代码☆16Jan 23, 2024Updated 2 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- 自动驾驶系统demo☆16Jun 23, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆15Apr 17, 2022Updated 4 years ago
- ☆15Feb 5, 2022Updated 4 years ago
- Simple Tensorflow implementation of "MirrorGAN: Learning Text-to-image Generation by Redescription" (CVPR 2019)☆15Mar 23, 2020Updated 6 years ago
- pytorch implementation for "Mutual Information Neural Estimation"☆11Dec 13, 2019Updated 6 years ago
- 主观题☆11Dec 19, 2018Updated 7 years ago
- This is my attempt at recreating the CycleGAN paper: https://arxiv.org/pdf/1703.10593.pdf☆12Apr 13, 2017Updated 9 years ago
- A ros package to control unitree a1 along side with unitree_ros package utilizing a pytorch model trained in isaac-gym☆13Jul 16, 2023Updated 2 years ago
- Code and analyses related to the ExaLearn drug design efforts☆11Sep 30, 2020Updated 5 years ago
- ☆13Jan 13, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- 这是2023FSAC使用的VCU程序,保函整车驱动、逻辑交互以及控制策略。无人车第一代下层控制策略尚不成熟,欢迎广大FSAC/FSEC以及汽车界各位前辈、同学以及学弟学妹们指正、补充。同时本代码开源在HRT电无人自主搭建的开源网站上,随后将发布。开源网站将发布所有上层感知与…☆28Jan 7, 2026Updated 5 months ago
- 图神经网络课程——图注意力网络☆11Dec 28, 2019Updated 6 years ago
- Homepage and materials for the course on data visualization, as part of uc3m’s Master in Computational Social Science☆14Feb 5, 2026Updated 4 months ago
- Training a car to drive in the CarRacing-v0 Gym Environment using imitation learning.☆21Oct 18, 2020Updated 5 years ago
- Tensorflow ResNet implementation on cifar10☆13Aug 10, 2017Updated 8 years ago
- This repository contains implementation of reinforcement learning based driving agent in Carla aswell as Localization and Mapping in C++☆13Oct 19, 2023Updated 2 years ago
- nonlinear solver for the constrained problem☆21Jun 7, 2026Updated last week
- Training Federated GANs with Theoretical Guarantees: AUniversal Aggregation Approach☆17Jan 18, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆75Aug 18, 2021Updated 4 years ago
- Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!☆10Mar 7, 2018Updated 8 years ago
- OpenAI Gym 学习笔记☆14Aug 25, 2018Updated 7 years ago
- 一些周志华西瓜书(公式参照南瓜书)的学习总结和记录☆12May 12, 2019Updated 7 years ago
- ☆25Oct 22, 2015Updated 10 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)☆12Oct 25, 2018Updated 7 years ago
- clear single-file JAX implementations of common RL algorithms☆15Sep 5, 2021Updated 4 years ago
- We're generating faces with Pytorch GAN implementations☆15Dec 18, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ♊ Stanford CS230 : Deep Learning☆16Jan 14, 2019Updated 7 years ago
- On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)☆11Nov 20, 2017Updated 8 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆52Apr 13, 2019Updated 7 years ago
- Train a bidirectional or normal LSTM recurrent neural network to generate text on a free GPU using any dataset. Just upload your text fil…☆12Jan 29, 2019Updated 7 years ago
- rust-libp2p examples☆11Dec 28, 2022Updated 3 years ago
- The ts302_team final solution to the KDD CUP 2019 AutoML Track problem.☆15Jul 3, 2020Updated 5 years ago
- Exercises and R code related to the book Applied Predictive Modeling by Max Kuhn and Kjell Johnson☆12Feb 3, 2016Updated 10 years ago