cryer/D.Silver_RL_Course

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cryer/D.Silver_RL_Course)

cryer / D.Silver_RL_Course

Some notes and experience about David Silver's Reinforcement Learning Course

☆47

Alternatives and similar repositories for D.Silver_RL_Course

Users that are interested in D.Silver_RL_Course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FanmingL / SmartLogger
View on GitHub
☆12May 14, 2024Updated 2 years ago
AIDefender / MyDiscor
View on GitHub
Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"
☆14May 24, 2021Updated 5 years ago
yuanyehome / PALT
View on GitHub
This is the source code of our paper PALT in EMNLP2022.
☆12Nov 19, 2022Updated 3 years ago
kvfrans / Easy21-RL
View on GitHub
solutions to David Silver's RL course project Easy21
☆19Jun 28, 2016Updated 10 years ago
apexrl / EBIL-torch
View on GitHub
Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>
☆12Oct 8, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
alessiabertugli / AC-VRNN
View on GitHub
PyTorch code for CVIU paper "AC-VRNN: Attentive Conditional-VRNN for Multi-Future Trajectory Prediction"
☆26Jul 8, 2021Updated 5 years ago
typoverflow / UtilsRL
View on GitHub
A python module designed for agile RL algorithm developing.
☆26Jul 11, 2024Updated 2 years ago
osudrl / RSS-2020-learning-memory-based-control
View on GitHub
Code for recreating the results of our RSS 2020 paper, 'Learning Memory-Based Control for Human-Scale Bipedal Locomotion.'
☆10Aug 18, 2022Updated 3 years ago
LuEE-C / Noisy-A3C-Keras
View on GitHub
☆10May 29, 2018Updated 8 years ago
zuoxingdong / gym-recsys
View on GitHub
Customizable RecSys Simulator for OpenAI Gym
☆26Dec 7, 2021Updated 4 years ago
Zhiyu-Lei / Traffic-Sign-Detection-and-Information-Extraction
View on GitHub
Train YOLO object detection model to find traffic signs in the images. Use OCR pipeline to extract the information from the signs with te…
☆13Dec 26, 2020Updated 5 years ago
allenai / allennlp-reading-comprehension-research
View on GitHub
☆41Feb 12, 2019Updated 7 years ago
GAIR-NLP / MetaCritique
View on GitHub
Evaluate the Quality of Critique
☆37Jun 1, 2024Updated 2 years ago
sheep333c / DIVE
View on GitHub
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
☆26Mar 13, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
angelobanse / sumoScenarioGenerator
View on GitHub
SUMO Scenario Generator is a web application that generates and downloads the necessary files to start a basic road traffic simulation in…
☆12Jun 25, 2020Updated 6 years ago
jsbaan / DPAC-DialogueGAN
View on GitHub
This repo implements GAN-based models for Dialogue Generation (DP-GAN, SeqGAN, and our own proposed DPAC-GAN)
☆29Mar 24, 2024Updated 2 years ago
gunchagarg / differential-learning-rate-keras
View on GitHub
Implementation of Differential Learning Rate in Keras
☆11Jun 4, 2019Updated 7 years ago
buschman-lab / RotationalDynamics
View on GitHub
Code for creating recurrent neural network with rotational dynamics. Model is discussed in detail in "Rotational Dynamics Reduce Interfer…
☆17Jul 23, 2020Updated 6 years ago
Hyeokreal / Actor-Critic-Continuous-Keras
View on GitHub
Keras Implementation of the continuous control with actor-critic, a3c
☆13Dec 3, 2017Updated 8 years ago
MIRALab-USTC / L2O-G2MILP
View on GitHub
This is the code for G2MILP, a deep learning-based mixed-integer linear programming (MILP) instance generator.
☆37Oct 3, 2024Updated last year
IBiDat / dataviz
View on GitHub
Homepage and materials for the course on data visualization, as part of uc3m’s Master in Computational Social Science
☆14Feb 5, 2026Updated 5 months ago
L706077 / Deep-Reinforcement-Learning-Papers
View on GitHub
awesome deep learning papers for reinforcement learning
☆17Jan 10, 2018Updated 8 years ago
Continual-Lifelong-Learning / resources
View on GitHub
☆17Feb 21, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
liuhuanshuo / notes-python
View on GitHub
中文 Python 笔记
☆12Jan 15, 2018Updated 8 years ago
lancopku / SACT
View on GitHub
Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)
☆14Apr 16, 2019Updated 7 years ago
gunnarfloetteroed / java
View on GitHub
☆13May 20, 2022Updated 4 years ago
WillenPeng / Path_Planning_Algorithm
View on GitHub
☆12Dec 10, 2018Updated 7 years ago
moinnadeem / CDSSM
View on GitHub
Convolutional Deep Semantic Similarity Model
☆20Feb 15, 2023Updated 3 years ago
empriselab / feeding-deployment
View on GitHub
Code for the robot-assisted feeding project at EmPRISE Lab
☆30Updated this week
mepa / datasci-from-scratch-notes
View on GitHub
Notes on "Data Science from Scratch" by Joel Grus
☆11Aug 9, 2016Updated 9 years ago
GAIR-NLP / scaleeval
View on GitHub
Scalable Meta-Evaluation of LLMs as Evaluators
☆43Feb 15, 2024Updated 2 years ago
christianfosli / wsl-copy
View on GitHub
Vim plugin to copy text to Windows clipboard on WSL
☆12Jan 8, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
MLWhiz / chatbot
View on GitHub
This is a repository for the post : https://mlwhiz.com/blog/2019/04/15/chatbot/
☆11Apr 15, 2019Updated 7 years ago
kkelchte / task_free_continual_learning
View on GitHub
This repository demonstrates the application of our proposed task-free continual learning method on a synthetic experiment.
☆13Jun 24, 2019Updated 7 years ago
percevalw / rich-logger
View on GitHub
Table logger using Rich
☆13Aug 13, 2025Updated 11 months ago
rohanchandra30 / GraphRQI
View on GitHub
This is the codebase for our ICRA 2020 submission, GraphRQI: Classifying Driver Behaviors Using Graph Spectrums.
☆13Dec 8, 2019Updated 6 years ago
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
berkay-onder / ELMoForManyLangs
View on GitHub
☆12Aug 25, 2019Updated 6 years ago
facebookresearch / SALSA
View on GitHub
Source code for the paper SALSA Attacking Lattice Cryptography with Transformers (Wenger et al., Neurips 2022)
☆30Nov 22, 2022Updated 3 years ago