Kotlin implementation of algorithms, examples, and exercises from the Sutton and Barto: Reinforcement Learning (2nd Edition)
☆40Apr 18, 2021Updated 4 years ago
Alternatives and similar repositories for Reinforcement-Learning-An-Introduction
Users that are interested in Reinforcement-Learning-An-Introduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Advantage Alignment Algorithms (ICLR 2025 oral)☆18Apr 7, 2025Updated last year
- [AAAI 2018] Implementation of the Ethics Shaping approach proposed in "A low-cost ethics shaping approach for designing reinforcement lea…☆11Aug 3, 2018Updated 7 years ago
- An idiomatic kotlin dataframe toolkit for data engineering tasks of any size dataset☆10Jul 16, 2025Updated 8 months ago
- Simple PHP API to CouchDB☆27Aug 13, 2011Updated 14 years ago
- A systematic design process for a self-organizing neuro-fuzzy Q-network for model-free and offline reinforcement learning.☆11May 29, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This is a proof of concept exploit which bypasses root detection in Samsung's Knox Messenger. This has been reported to Knox Messenger te…☆15Jan 30, 2020Updated 6 years ago
- A dynamic version of std::bitset☆17Aug 25, 2013Updated 12 years ago
- A reporting project on the performance of self-optimizing interpreters☆16Dec 5, 2015Updated 10 years ago
- Curated list of machine learning and deep learning frameworks and resources for JVM☆20Dec 9, 2020Updated 5 years ago
- A multi-agent reinforcement learning framework for optimizing coverage and connectivity in Space-Air-Ground integrated networks. This pro…☆58Feb 26, 2026Updated last month
- DQN, DDQN, and Policy Gradient Algorithm-Based Antenna Selection Schemes in MIMO Systems.☆11Dec 17, 2023Updated 2 years ago
- Image Classification Demo for Keras at TensorFlow Summit Extended KL 2017☆11Jul 1, 2020Updated 5 years ago
- This repository contains the simulation source code for implementing reinforcement learning aglorithms for autonomous navigation of ardon…☆11Mar 20, 2021Updated 5 years ago
- Deep Reinforcement Learning (RL) Using Python☆35Apr 20, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- paper code☆11Jul 25, 2022Updated 3 years ago
- C++17 exploration of a classic MUD like game☆15Jun 6, 2021Updated 4 years ago
- Deep Q network-based power allocation for multi-cell massive MIMO cellular network.☆21Dec 17, 2023Updated 2 years ago
- ☆14Jun 20, 2023Updated 2 years ago
- Policy learning of in-hand manipulation. Proximal policy optimization trains the Allegro hand to learn a stabilizing grasp☆14Feb 5, 2024Updated 2 years ago
- ☆12Mar 24, 2023Updated 3 years ago
- Replaced by http://beaucatcher.org/, ignore this repo☆15Jul 4, 2011Updated 14 years ago
- Miniprojects for the MICRO-507 : Legged Robots course☆12Jul 1, 2022Updated 3 years ago
- ☆11Mar 31, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Londogard Natural Language Processing Toolkit written in Kotlin☆74Apr 13, 2023Updated 2 years ago
- Promises and Java8 / RXJava like streaming for Xtend☆15Feb 16, 2018Updated 8 years ago
- Deep Reinforcement Learning and BCD to solve phase shift and resource allocation of RIS and RSU☆32Jan 18, 2021Updated 5 years ago
- Neural Network with single hidden layer learning MNIST with less than 1.2% test error.☆22May 4, 2013Updated 12 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 9 months ago
- The com.ostermiller.util package is open source (GPL) Java utilities maintained by Stephen Ostermiller.☆13Jan 27, 2023Updated 3 years ago
- A parallelized implementation of optimized MSD radix sort for strings☆27Oct 11, 2018Updated 7 years ago
- Multi-Agent Determinantal Q-Learning☆43Nov 22, 2022Updated 3 years ago
- 本模型是基于Python3编写的工商信息图片识别文字模型。主要步骤包括去噪、分离水印、反色、二值化增强等,支持批量图片处理。☆13Mar 14, 2020Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Simple and sane Java path watching API on top of WatchService☆15Jun 9, 2016Updated 9 years ago
- Contact-Aware Symplectic Integrator Network☆16Mar 22, 2023Updated 3 years ago
- Multiagent gridworld for the TEAM project based on gym-minigrid☆12Nov 27, 2019Updated 6 years ago
- ☆19Mar 18, 2024Updated 2 years ago
- Matlab codes for paper 'K. -H. Ngo, N. T. Nguyen, T. Q. Dinh, T. -M. Hoang and M. Juntti, "Low-Latency and Secure Computation Offloading …☆30Feb 13, 2022Updated 4 years ago
- analysis of public NLP corpora☆11Feb 9, 2023Updated 3 years ago
- This project contains several Deep Reinforcement Learning method and some experiments basd on OpenAi gym.☆19Jan 28, 2018Updated 8 years ago