课程笔记,David Silver,CS294 ...
☆15Jan 7, 2019Updated 7 years ago
Alternatives and similar repositories for note-on-Deep-Reinforcement-Learning
Users that are interested in note-on-Deep-Reinforcement-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 斯坦福大学 机器学习 吴恩达 Coursera☆14Feb 8, 2018Updated 8 years ago
- ☆11Aug 14, 2018Updated 7 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- ☆15Feb 5, 2022Updated 4 years ago
- The source code of the paper image enhanced event detection in news articles.☆11May 27, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Aug 30, 2023Updated 2 years ago
- Simple Tensorflow implementation of "MirrorGAN: Learning Text-to-image Generation by Redescription" (CVPR 2019)☆15Mar 23, 2020Updated 6 years ago
- ☆12Nov 20, 2023Updated 2 years ago
- This is my attempt at recreating the CycleGAN paper: https://arxiv.org/pdf/1703.10593.pdf☆12Apr 13, 2017Updated 9 years ago
- Build an RNN in Keras used for predicting stock prices.☆10May 8, 2018Updated 7 years ago
- Code and analyses related to the ExaLearn drug design efforts☆11Sep 30, 2020Updated 5 years ago
- The aim of this project is to help medical practitioners in a practical way to combat the pandemic.☆11Apr 17, 2021Updated 4 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- Notes on "Data Science from Scratch" by Joel Grus☆11Aug 9, 2016Updated 9 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- K-Means is a clustering algorithm which is used for cluster analysis in data mining; it partitions the data set into k clusters. In this …☆11Aug 19, 2017Updated 8 years ago
- Implementation of Differential Learning Rate in Keras☆11Jun 4, 2019Updated 6 years ago
- Udacity (CS101, CS212, CS253, CS262, CS373, CS387) coursework (classwork, quizzes, homework, final exams, projects, contests)☆30Jun 16, 2013Updated 12 years ago
- Removes gaussian noise from colored images using an autoencoder.☆12Dec 2, 2021Updated 4 years ago
- Homepage and materials for the course on data visualization, as part of uc3m’s Master in Computational Social Science☆14Feb 5, 2026Updated 2 months ago
- A gan that gives Portinari's style to real photos☆11Apr 3, 2021Updated 5 years ago
- Implementations of Original Gan, Wgan-div, Lsgan and others with training animations☆10Nov 13, 2019Updated 6 years ago
- Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)☆11Jul 4, 2022Updated 3 years ago
- Training a car to drive in the CarRacing-v0 Gym Environment using imitation learning.☆21Oct 18, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Tensorflow ResNet implementation on cifar10☆13Aug 10, 2017Updated 8 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- A simple Flask app to generate answer given an image and a natural language question about the image. The app uses a deep learning model,…☆12Nov 21, 2022Updated 3 years ago
- Training Federated GANs with Theoretical Guarantees: AUniversal Aggregation Approach☆17Jan 18, 2021Updated 5 years ago
- Least Squares Generative Adversarial Network implemented in Chainer☆18Dec 11, 2017Updated 8 years ago
- ☆76Aug 18, 2021Updated 4 years ago
- A Java project for DMCNN. DMCNN is a state-of-the-art method used in event extraction.☆23Oct 8, 2015Updated 10 years ago
- Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!☆10Mar 7, 2018Updated 8 years ago
- A bachelor thesis project about autonomous car maneuver around roundabout using RL-DQN☆14Nov 26, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆24Jul 20, 2017Updated 8 years ago
- 一些周志华西瓜书(公式参照南瓜书)的学习总结和记录☆12May 12, 2019Updated 6 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- clear single-file JAX implementations of common RL algorithms☆16Sep 5, 2021Updated 4 years ago
- scunet登录脚本☆10Sep 7, 2019Updated 6 years ago
- On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)☆11Nov 20, 2017Updated 8 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆52Apr 13, 2019Updated 7 years ago