Kotlin implementation of algorithms, examples, and exercises from the Sutton and Barto: Reinforcement Learning (2nd Edition)
☆41Apr 18, 2021Updated 5 years ago
Alternatives and similar repositories for Reinforcement-Learning-An-Introduction
Users that are interested in Reinforcement-Learning-An-Introduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Naive Bayes Tweet Sentiment Classifier in Kotlin☆14Sep 21, 2020Updated 5 years ago
- Sparse Boolean linear algebra for Nvidia Cuda, OpenCL and CPU computations☆16Aug 19, 2022Updated 3 years ago
- [AAAI 2018] Implementation of the Ethics Shaping approach proposed in "A low-cost ethics shaping approach for designing reinforcement lea…☆11Aug 3, 2018Updated 7 years ago
- A systematic design process for a self-organizing neuro-fuzzy Q-network for model-free and offline reinforcement learning.☆11May 29, 2023Updated 3 years ago
- [Android] Samsung Knox Standard activation helper library for Android☆12Jun 14, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …☆16Jan 22, 2019Updated 7 years ago
- ☆20Aug 16, 2020Updated 5 years ago
- ☆11Apr 4, 2022Updated 4 years ago
- Tensors, Model Inference, Linear Regression, MNIST, LSTM with TensorFlow, DL4j, komputation, DJL on Kotlin☆17Dec 18, 2020Updated 5 years ago
- A reporting project on the performance of self-optimizing interpreters☆16Dec 5, 2015Updated 10 years ago
- The evaluation code for the paper "Radar Aided Proactive Blockage Prediction in Real-World Millimeter Wave Systems".☆10Apr 21, 2025Updated last year
- Implementation of deep reinforcement learning for optimizing the beams and predicting the blockage events☆17Nov 8, 2020Updated 5 years ago
- ☆35Sep 27, 2018Updated 7 years ago
- DQN, DDQN, and Policy Gradient Algorithm-Based Antenna Selection Schemes in MIMO Systems.☆11Dec 17, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆76Sep 17, 2024Updated last year
- Some simulations for wireless RL☆12May 7, 2023Updated 3 years ago
- Reinforcement Learning Environments for Omniverse Isaac Gym☆10May 9, 2023Updated 3 years ago
- Tensor Belief Propagation - algorithm for approximate inference in discrete graphical models☆12Feb 17, 2020Updated 6 years ago
- This repository contains the simulation source code for implementing reinforcement learning aglorithms for autonomous navigation of ardon…☆11Mar 20, 2021Updated 5 years ago
- Deep Reinforcement Learning (RL) Using Python☆35Apr 20, 2024Updated 2 years ago
- Overview of Clone Detection Tools for Java☆14Aug 23, 2025Updated 9 months ago
- An Android Instrumentation tool to compute Code Coverage☆19Apr 28, 2026Updated last month
- paper code☆11Jul 25, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LiFi Visible Light Positioning☆16Dec 16, 2018Updated 7 years ago
- Deep Q network-based power allocation for multi-cell massive MIMO cellular network.☆21Apr 11, 2026Updated 2 months ago
- Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …☆12Mar 29, 2019Updated 7 years ago
- ☆14Jun 20, 2023Updated 2 years ago
- Policy learning of in-hand manipulation. Proximal policy optimization trains the Allegro hand to learn a stabilizing grasp☆14Feb 5, 2024Updated 2 years ago
- ☆11Mar 31, 2020Updated 6 years ago
- Sample audio and video files for the YouTube Video Tutorials on HTML5 Audio and Video☆16Mar 4, 2021Updated 5 years ago
- Promises and Java8 / RXJava like streaming for Xtend☆15Feb 16, 2018Updated 8 years ago
- Natural Language or Not☆11Jun 20, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Deep Reinforcement Learning and BCD to solve phase shift and resource allocation of RIS and RSU☆32Jan 18, 2021Updated 5 years ago
- The com.ostermiller.util package is open source (GPL) Java utilities maintained by Stephen Ostermiller.☆13Jan 27, 2023Updated 3 years ago
- A parallelized implementation of optimized MSD radix sort for strings☆27Oct 11, 2018Updated 7 years ago
- Multi-Agent Determinantal Q-Learning☆43Nov 22, 2022Updated 3 years ago
- 本模型是基于Python3编写的工商信息图片识别文字模型。主要步骤 包括去噪、分离水印、反色、二值化增强等,支持批量图片处理。☆13Mar 14, 2020Updated 6 years ago
- Contact-Aware Symplectic Integrator Network☆17Mar 22, 2023Updated 3 years ago
- Multiagent gridworld for the TEAM project based on gym-minigrid☆12Nov 27, 2019Updated 6 years ago