Reinforcement learning in python
☆36Mar 24, 2019Updated 7 years ago
Alternatives and similar repositories for rl
Users that are interested in rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Q-Networks in tensorflow☆10Apr 4, 2017Updated 9 years ago
- DDPG on OpenAI Gym Pendulum☆17Jul 1, 2016Updated 9 years ago
- A tool for experimenting with evolutionary optimization methods for machine learning algorithms, by distributing the workload over a larg…☆14Dec 19, 2018Updated 7 years ago
- Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)☆13Oct 25, 2023Updated 2 years ago
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆14Mar 19, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆17Jan 15, 2022Updated 4 years ago
- Simple, small, fully-connected Python version of NeoRL☆11Jan 29, 2016Updated 10 years ago
- Source Cooperative Web Interface & API☆23May 22, 2026Updated last week
- MATLAB implementation of DQN for a navigation environment☆13Aug 13, 2020Updated 5 years ago
- Exercises for the semi-supervised summer school https://semisupervised-learning.compute.dtu.dk.☆11Aug 11, 2016Updated 9 years ago
- This repository implements the calculation for 2 types of queues in the Queue Theory, namely, the M/M/c Queue and the M/M/c/c Queue.☆27Feb 4, 2017Updated 9 years ago
- Sample code for generative recurrent autoencoders.☆26Nov 12, 2016Updated 9 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Jun 14, 2018Updated 7 years ago
- DQN implemented in keras with Dueling Network and Prioritized Experience Replay☆16Nov 21, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- ☆27Dec 2, 2017Updated 8 years ago
- Deep reinforcement learning. In scikit-learn. In less than 50 effective lines.☆54Jan 23, 2017Updated 9 years ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Apr 5, 2017Updated 9 years ago
- Code for a generative controller for the AI Gym cartpole task☆15Feb 22, 2017Updated 9 years ago
- ☆20Mar 1, 2019Updated 7 years ago
- L4DC2021 code repository☆14Apr 14, 2021Updated 5 years ago
- 11-785 Group Project: YouShen Poetry generation☆10Dec 23, 2020Updated 5 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A curated list of materials on AI guardrails☆53Jun 3, 2025Updated 11 months ago
- Support Sustainable Computing to provide customer with metrics for their carbon footprint workload☆14Mar 26, 2026Updated 2 months ago
- ☆23Dec 4, 2024Updated last year
- coding examples to Intro to RL☆13Apr 30, 2018Updated 8 years ago
- Adaptive Memory Prediction Framework☆15Apr 19, 2015Updated 11 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- ☆31Apr 2, 2022Updated 4 years ago
- Convert a Caffe Model to a Theano Model☆11Mar 30, 2015Updated 11 years ago
- Code from posts at AlgorthmicAlley.com☆14Nov 27, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Automatically exported from code.google.com/p/pyrbf☆11May 4, 2015Updated 11 years ago
- Code for Personalized Federated Learning with Gaussian Processes☆37Mar 3, 2022Updated 4 years ago
- TopoRhino☆12Feb 4, 2020Updated 6 years ago
- Implementation of the Monte-Carlo CTW AIXI approximation as described by Joel Veness et al.☆12Jan 14, 2017Updated 9 years ago
- The mechanoChemIGA code is an isogeometric analysis based code used to solve the partial differential equations describing solid mechanic…☆14Oct 15, 2020Updated 5 years ago
- ☆11May 5, 2026Updated 3 weeks ago
- Deep Learning - Multi-Task Representation Learning using Shared Architecture for Deep Neural Networks☆19Apr 11, 2017Updated 9 years ago