I have created a small book summarizing concepts from the Reinforcement Learning part of the ATML 2015 course at UCL (https://www.davidsilver.uk/teaching/)
☆43Dec 30, 2021Updated 4 years ago
Alternatives and similar repositories for Reinforcement-Learning-Book
Users that are interested in Reinforcement-Learning-Book are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- HiDeF (Hierarchical community Decoding Framework)☆15Jan 30, 2024Updated 2 years ago
- RxCoreNFC (based on RxSwift)☆10Apr 15, 2021Updated 5 years ago
- The JSON file for the ICD-9-CM and ICD-10-CM hierarchy, including diagnosis codes and procedure codes☆13Jan 26, 2023Updated 3 years ago
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆11Nov 16, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Causal Impact of an intervention integrated with control group selection☆10Sep 11, 2022Updated 3 years ago
- RxVision (based on RxSwift)☆13Jul 1, 2020Updated 5 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Sep 24, 2025Updated 7 months ago
- A multi-hop Q/A architecture based on transformers and GCNs.☆16Aug 12, 2019Updated 6 years ago
- ☆15Dec 18, 2017Updated 8 years ago
- ☆12Apr 19, 2019Updated 7 years ago
- The source of the new Skin UI SDKs for both Android and IOS☆13Jul 8, 2023Updated 2 years ago
- A network based gene classification library to generate genome wide predictions about genes that are functionally similar to the input ge…☆20Apr 22, 2026Updated 3 weeks ago
- Using Database Rule for Weak Supervised Text-to-SQL Generation https://arxiv.org/abs/1907.00620☆12May 11, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13Aug 16, 2020Updated 5 years ago
- A Comparative Study of Various Code Embeddings in Software Semantic Matching☆18Dec 8, 2022Updated 3 years ago
- The Arcade Learning Environment (ALE) -- a platform for AI research.☆26Apr 17, 2026Updated last month
- A pure Python and Numpy implementation of an LSTM Network☆14Mar 2, 2017Updated 9 years ago
- My notes during the 2017 IIIT Hyderabad Summer Schools on Computer Vision and Machine Learning☆24Sep 28, 2017Updated 8 years ago
- ☆94Mar 16, 2026Updated 2 months ago
- RxAVFoundation (based on RxSwift)☆17Jun 16, 2022Updated 3 years ago
- Communication using GNN in MARL☆35Jan 3, 2022Updated 4 years ago
- ☆16Nov 7, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- **IUST_PersonReID** is a culturally distinctive dataset for person re-identification, focused on Islamic countries, designed to help redu…☆32Jun 29, 2025Updated 10 months ago
- Dynamic Kd-Tree: Euclidean, SO(2), SO(3), SE(3) and more!☆37May 11, 2026Updated last week
- iOS Augmented Reality sample code for blog post☆16May 2, 2017Updated 9 years ago
- 💡 A Philips Hue library written in Swift, using Combine framework☆16Feb 16, 2020Updated 6 years ago
- SPACE: STRING proteins as complementary embeddings☆39Apr 10, 2026Updated last month
- Track device orientation changes even for devices with orientation-lock turned on.☆19Oct 16, 2020Updated 5 years ago
- A SQL Query Similarity Metric Benchmark☆16Apr 22, 2018Updated 8 years ago
- Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein Localization Prediction☆29Oct 1, 2023Updated 2 years ago
- ☆15Apr 25, 2018Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Build neural networks with less boilerplate code☆165Aug 23, 2023Updated 2 years ago
- k-nearest neighbors and dynamic time warping written in Swift☆20Aug 31, 2016Updated 9 years ago
- A simple demonstration of the CMAltimeter class and barometer sensor on iPhone 6 / 6 Plus and newer.☆25Sep 21, 2016Updated 9 years ago
- ☆15Aug 21, 2022Updated 3 years ago
- A Julia Package for providing Multi Armed Bandit Experiments☆21Jul 19, 2018Updated 7 years ago
- New York Times articles as a knowledge graph☆29Nov 7, 2023Updated 2 years ago
- The unified framework for sim & real robot teleoperation☆186Updated this week