Study repo for David Silver's Reinforcement Learning Course
☆12Apr 26, 2019Updated 6 years ago
Alternatives and similar repositories for David-Silver-Reinforcement-Learning-UCL
Users that are interested in David-Silver-Reinforcement-Learning-UCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023] Code base for the Renyi Kernel Entropy (RKE) metric for generative models.☆13Jun 18, 2025Updated 9 months ago
- In this Repo, native Farsi speakers share their experiences about the international Ph.D. application and study abroad processes.☆19Jul 16, 2024Updated last year
- کدهای مربوط به مقالات آموزشی☆16Mar 19, 2023Updated 3 years ago
- A repository of code examples to accompany the LSU CSC7809/7700/47000 course on AI foundation models.☆13Apr 5, 2025Updated 11 months ago
- Easy21 assignment from David Silver's RL Course at UCL☆12Apr 29, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Apr 18, 2023Updated 2 years ago
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- My solution to Collaboration and Competition using MADDPG algorithm, Udacity 3rd project of Deep RL Nanodegree from the paper "Multi-Agen…☆10Oct 6, 2019Updated 6 years ago
- A list of publicly available resources regarding the SAS7BDAT file format☆11Jan 10, 2022Updated 4 years ago
- A repo to design basic Policy Gradient labs☆12Jul 6, 2023Updated 2 years ago
- Non-orthogonal multiple access (NOMA) for Indoor Visible Light Communications. We offer a complete review of PD-NOMA-based VLC systems in…☆17Oct 18, 2023Updated 2 years ago
- Making agents bet on polymarket☆23Oct 15, 2025Updated 5 months ago
- Traffic Steering (TS) xApp for OAIC O-RAN Testbed☆12Nov 8, 2023Updated 2 years ago
- RLFP (CoRL 2024)☆14Oct 11, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Sep 13, 2024Updated last year
- ☆17Oct 14, 2021Updated 4 years ago
- Python probabilistic PCA (PPCA) implementation.☆13Nov 28, 2018Updated 7 years ago
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Dec 19, 2022Updated 3 years ago
- ☆21Jun 16, 2023Updated 2 years ago
- ☆10Jun 13, 2021Updated 4 years ago
- Decentralized deep multi-agent reinforcement learning in physical environments.☆14Aug 19, 2018Updated 7 years ago
- Evidential Calibration☆11Mar 8, 2022Updated 4 years ago
- Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir☆11Oct 6, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Feb 29, 2024Updated 2 years ago
- The homework assignments finished for the coursera specialization "Probabilistic Graphical Models"☆13Jun 16, 2017Updated 8 years ago
- Simple correct&smooth implementation in PyTorch.☆13Nov 8, 2022Updated 3 years ago
- Human Pose and Hip Trajectory Prediction Using Transformers☆16Oct 11, 2023Updated 2 years ago
- ☆17Mar 14, 2026Updated last week
- Repository of the paper "On the Trade-off between Over-smoothing and Over-squashing in Deep Graph Neural Networks" published in ACM CIKM …☆18Aug 8, 2023Updated 2 years ago
- Egocentric Temporal Motifs Miner☆12Nov 9, 2021Updated 4 years ago
- This is the code of reproducing the results of our paper: On the importance of Hyperparameter Optimization for Model-based Reinforcement …☆16Aug 19, 2021Updated 4 years ago
- ETNgen: A temporal graph generator based on Egocentric Temporal Motifs☆14Aug 11, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Replication data and code for "Prestige drives epistemic inequality in the diffusion of scientific ideas"☆14Dec 14, 2018Updated 7 years ago
- Coursera Course --- Probabilistic Graphical Model☆14Jan 5, 2015Updated 11 years ago
- ☆10Dec 6, 2017Updated 8 years ago
- Welcome to the Machine Learning Engineering Repository, a comprehensive collection of resources, code, and insights to guide you through…☆25Feb 25, 2025Updated last year
- Sparse graph attention☆17Sep 20, 2018Updated 7 years ago
- Notes on "Data Science from Scratch" by Joel Grus☆11Aug 9, 2016Updated 9 years ago
- ☆14Dec 13, 2021Updated 4 years ago