I have created a small book summarizing concepts from the Reinforcement Learning part of the ATML 2015 course at UCL (https://www.davidsilver.uk/teaching/)
☆44Dec 30, 2021Updated 4 years ago
Alternatives and similar repositories for Reinforcement-Learning-Book
Users that are interested in Reinforcement-Learning-Book are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- hanabi_learning_environment is a research platform for Hanabi experiments.☆11May 24, 2026Updated last month
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- HiDeF (Hierarchical community Decoding Framework)☆15Jan 30, 2024Updated 2 years ago
- Using WoLF (win or learn fast) PHC (policy hill climbing) algorithm to implement stochastic games☆15Jun 14, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The JSON file for the ICD-9-CM and ICD-10-CM hierarchy, including diagnosis codes and procedure codes☆14Jan 26, 2023Updated 3 years ago
- Float NDArray library for Swift, accelerated with Accelerate Framework☆13Apr 17, 2019Updated 7 years ago
- vImage / Accelerate convolution filter in Swift☆14May 20, 2015Updated 11 years ago
- A project for trying out the various CoreImage filters supported by iOS☆18Jan 18, 2017Updated 9 years ago
- Causal Impact of an intervention integrated with control group selection☆10Sep 11, 2022Updated 3 years ago
- Manubot for white paper☆13Sep 29, 2020Updated 5 years ago
- Calculation of power indices (e.g. Banzhaf power index, Shapley-Shubik power index etc)☆18Nov 26, 2025Updated 7 months ago
- A (user-)friendly wrapper to nvdia-smi☆28Mar 17, 2024Updated 2 years ago
- ☆15Dec 18, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fast and procedurally generated side-scroller-game-like graphical environments (formerly Procgen)☆35Jul 7, 2023Updated 2 years ago
- Using Database Rule for Weak Supervised Text-to-SQL Generation https://arxiv.org/abs/1907.00620☆12May 11, 2021Updated 5 years ago
- ☆13Aug 16, 2020Updated 5 years ago
- Advanced Deep Learning and Reinforcement Learning 2018 Assignments☆18Nov 24, 2018Updated 7 years ago
- A Python FASTA file Parser and Writer.☆17Sep 3, 2022Updated 3 years ago
- ☆16Nov 7, 2020Updated 5 years ago
- This is an unofficial LaTeX Beamer poster template for Stanford University.☆12Dec 27, 2018Updated 7 years ago
- GupShup: Summarizing Open-Domain Code-Switched Conversations EMNLP 2021☆15Nov 14, 2021Updated 4 years ago
- ☆29Feb 23, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- iOS Augmented Reality sample code for blog post☆16May 2, 2017Updated 9 years ago
- SmoothE: Differentiable E-Graph Extraction (ASPLOS'25 Best Paper)☆33Jan 15, 2026Updated 5 months ago
- 💡 A Philips Hue library written in Swift, using Combine framework☆16Feb 16, 2020Updated 6 years ago
- A complete Tensorflow implementation of cutout random erasing (without numpy)☆13Jan 8, 2019Updated 7 years ago
- Lux AI environment interface for RLlib multi-agents☆12Sep 23, 2021Updated 4 years ago
- This repository contains the code for implementing Bidirectional Relevance scores for Digital Histopathology, which was used for the resu…☆16Mar 24, 2023Updated 3 years ago
- [RSS 2026] The first framework enabling humanoid robots to learn whole-body loco-manipulation from egocentric human demos☆179Jun 6, 2026Updated 3 weeks ago
- SenseSystem is a flexible, extensible, and multi-functional system for detecting and responding to the detection of game objects in Unrea…☆22May 27, 2024Updated 2 years ago
- Cytoscape 3 implementation bundles.☆43May 27, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SPACE: STRING proteins as complementary embeddings☆40Apr 10, 2026Updated 2 months ago
- Unofficial implementation of Stand-Alone Self-Attention in Vision Models (obsolete)☆44Jul 1, 2019Updated 6 years ago
- Build neural networks with less boilerplate code☆165Aug 23, 2023Updated 2 years ago
- Optimized Differentiable Neural Computer In Chainer☆23Jul 12, 2018Updated 7 years ago
- Demo of UIImageOrientation to TIFF Orientation conversion that fixes orientation issues when creating CIImage from UIImage☆25Sep 4, 2016Updated 9 years ago
- k-nearest neighbors and dynamic time warping written in Swift☆20Aug 31, 2016Updated 9 years ago
- Automated Headline generation and Aspect Based Sentiment Analysis☆15Feb 16, 2023Updated 3 years ago