xia0nan/David-Silver-Reinforcement-Learning-UCL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xia0nan/David-Silver-Reinforcement-Learning-UCL)

xia0nan / David-Silver-Reinforcement-Learning-UCL

Study repo for David Silver's Reinforcement Learning Course

☆12

Alternatives and similar repositories for David-Silver-Reinforcement-Learning-UCL

Users that are interested in David-Silver-Reinforcement-Learning-UCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mjalali / renyi-kernel-entropy
View on GitHub
[NeurIPS 2023] Code base for the Renyi Kernel Entropy (RKE) metric for generative models.
☆14Jun 18, 2025Updated last year
Fatemeh-J / Putting-in-for-a-PhD-and-moving-abroad
View on GitHub
In this Repo, native Farsi speakers share their experiences about the international Ph.D. application and study abroad processes.
☆20Jul 16, 2024Updated 2 years ago
jghawaly / CSC7809_FoundationModels
View on GitHub
A repository of code examples to accompany the LSU CSC7809/7700/47000 course on AI foundation models.
☆13Apr 5, 2025Updated last year
mohamad-dehghani / tutorial
View on GitHub
کدهای مربوط به مقالات آموزشی
☆16Mar 19, 2023Updated 3 years ago
timbmg / easy21-rl
View on GitHub
Easy21 assignment from David Silver's RL Course at UCL
☆11Apr 29, 2018Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
moukamisama / Recon
View on GitHub
☆12Apr 18, 2023Updated 3 years ago
yarikama / Agentic-Advanced-RAG
View on GitHub
Building a multi-agent RAG system with advanced RAG methods
☆13Jan 12, 2025Updated last year
natanzi / ts-xapp
View on GitHub
Traffic Steering (TS) xApp for OAIC O-RAN Testbed
☆12Nov 8, 2023Updated 2 years ago
EyaRhouma / collaboration-competition-MADDPG
View on GitHub
My solution to Collaboration and Competition using MADDPG algorithm, Udacity 3rd project of Deep RL Nanodegree from the paper "Multi-Agen…
☆10Oct 6, 2019Updated 6 years ago
xiaodaigh / sas7bdat-resources
View on GitHub
A list of publicly available resources regarding the SAS7BDAT file format
☆11Jan 10, 2022Updated 4 years ago
osigaud / Basic-Policy-Gradient-Labs
View on GitHub
A repo to design basic Policy Gradient labs
☆12Jul 6, 2023Updated 3 years ago
feshchenkod / rpc-nodes
View on GitHub
☆21Jun 16, 2023Updated 3 years ago
YeWR / RLFP
View on GitHub
RLFP (CoRL 2024)
☆14Oct 11, 2024Updated last year
AUT-NLP / PQuAD
View on GitHub
☆13Mar 2, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
davidstutz / probabilistic-pca
View on GitHub
Python probabilistic PCA (PPCA) implementation.
☆13Nov 28, 2018Updated 7 years ago
SagiLevanon1 / scmp
View on GitHub
☆10Jun 13, 2021Updated 5 years ago
nicoring / rec-maddpg
View on GitHub
Decentralized deep multi-agent reinforcement learning in physical environments.
☆14Aug 19, 2018Updated 7 years ago
ituvisionlab / EvidentialTuringProcess
View on GitHub
Evidential Calibration
☆11Mar 8, 2022Updated 4 years ago
kargaranamir / Persian-Datasets
View on GitHub
Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir
☆14Oct 6, 2023Updated 2 years ago
jasonlovescoding / Coursera-ProbabilisticGraphicalModels
View on GitHub
The homework assignments finished for the coursera specialization "Probabilistic Graphical Models"
☆13Jun 16, 2017Updated 9 years ago
andrewk1 / correctandsmooth
View on GitHub
Simple correct&smooth implementation in PyTorch.
☆13Nov 8, 2022Updated 3 years ago
mmahdavian / STPOTR
View on GitHub
Human Pose and Hip Trajectory Prediction Using Transformers
☆16Oct 11, 2023Updated 2 years ago
tushar-ydv / VLC_NOMA
View on GitHub
Non-orthogonal multiple access (NOMA) for Indoor Visible Light Communications. We offer a complete review of PD-NOMA-based VLC systems in…
☆17Oct 18, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
automl / HPO_for_RL
View on GitHub
This is the code of reproducing the results of our paper: On the importance of Hyperparameter Optimization for Model-based Reinforcement …
☆16Aug 19, 2021Updated 4 years ago
AntonioLonga / Egocentric-Temporal-Motifs-Miner-ETMM
View on GitHub
Egocentric Temporal Motifs Miner
☆13Nov 9, 2021Updated 4 years ago
YunshengWei / PGM
View on GitHub
Coursera Course --- Probabilistic Graphical Model
☆15Jan 5, 2015Updated 11 years ago
AntonioLonga / ETNgen
View on GitHub
ETNgen: A temporal graph generator based on Egocentric Temporal Motifs
☆15Aug 11, 2023Updated 2 years ago
ozkary / machine-learning-engineering
View on GitHub
Welcome to the Machine Learning Engineering Repository, a comprehensive collection of resources, code, and insights to guide you through…
☆25Feb 25, 2025Updated last year
Barbany / Multi-speaker-Neural-Vocoder
View on GitHub
Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial fullfilment of the requirements for the degree in Telecommun…
☆16Jul 25, 2024Updated 2 years ago
allisonmorgan / epistemic_inequality
View on GitHub
Replication data and code for "Prestige drives epistemic inequality in the diffusion of scientific ideas"
☆14Dec 14, 2018Updated 7 years ago
Ericjeff / kafka-sparkStreaming-redis
View on GitHub
☆10Dec 6, 2017Updated 8 years ago
santos-j / xapp_development_zero_to_hero
View on GitHub
☆18Sep 13, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ds4dm / sparse-gcn
View on GitHub
Sparse graph attention
☆17Sep 20, 2018Updated 7 years ago
chao1224 / Loss-Balanced-Task-Weighting
View on GitHub
Loss-Balanced Task Weighting to Reduce Negative Transfer in Multi-Task Learning, AAAI-SA'19
☆30Sep 23, 2019Updated 6 years ago
gunchagarg / differential-learning-rate-keras
View on GitHub
Implementation of Differential Learning Rate in Keras
☆11Jun 4, 2019Updated 7 years ago
ispamm / FairDrop
View on GitHub
☆14Updated this week
ThibautTheate / Risk-Sensitive-Policy-with-Distributional-Reinforcement-Learning
View on GitHub
Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…
☆16Dec 19, 2022Updated 3 years ago
haixiangyan / leetcode-python
View on GitHub
Python solutions to coding questions in Leetcode
☆13Sep 12, 2020Updated 5 years ago
yushundong / Fairness-must-read-list
View on GitHub
Papers on fairness
☆12Oct 20, 2020Updated 5 years ago