Various ways to learn a computer to escape from a maze. From random walk to a simple neural network.
☆109May 20, 2022Updated 4 years ago
Alternatives and similar repositories for Reinforcement-Learning-Maze
Users that are interested in Reinforcement-Learning-Maze are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Deep Q-learning to solve random mazes.☆20Jun 17, 2021Updated 4 years ago
- SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis☆31Aug 19, 2019Updated 6 years ago
- Interactive visualization tool for multiple volumes, meshes and points based on VTK. The app can be controlled with Python scripts (optio…☆11Mar 13, 2022Updated 4 years ago
- Code and data for Feher da Silva, C., Hare, T.A. Humans primarily use model-based inference in the two-stage task. Nat Hum Behav (2020). …☆13Jul 25, 2023Updated 2 years ago
- Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.☆36Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ACO with RL algorithm integration (Q-learning) and k-means boosted genetic applied to TSP☆10Sep 21, 2019Updated 6 years ago
- An attempt at recreating DeepMind's implementation of Deep Q Learning on Atari Breakout using PyTorch☆13Jan 16, 2020Updated 6 years ago
- A PyTorch implementation of deep Q-learning for Atari games☆13Dec 4, 2018Updated 7 years ago
- RL Agent for Atari Game Pong☆11Aug 25, 2019Updated 6 years ago
- ☆13Jul 13, 2016Updated 9 years ago
- A customizable framework to create maze and gridworld environments☆271Apr 5, 2019Updated 7 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆21Jul 27, 2020Updated 5 years ago
- Implementation and evaluation of combinatorial auction protocols: VCG and Groves mechanism with submodular approximation (GM-SMA)☆26Jan 7, 2023Updated 3 years ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆375Oct 9, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- human-readable solution of an arbitrary optimization problem through Reinforcement Learning☆12May 19, 2026Updated 3 weeks ago
- Implementation of QRL☆32Jun 22, 2019Updated 6 years ago
- A Julia IO type that facilitates width-limited printing☆12Mar 21, 2023Updated 3 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 7 years ago
- PennyLane/PyTorch implementation of Quantum agents in the Gym: a variational quantum algorithm for deep Q-learning (Skolik et al., 2021)☆39Mar 15, 2023Updated 3 years ago
- I added selfplay functionality to openai gyms☆10Jan 16, 2021Updated 5 years ago
- An implementation of Maximum Entropy model☆14Apr 28, 2012Updated 14 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- sketch-rnn demo for seoul mediacity biennale 2018☆13Sep 4, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementing Reinforcement Learning, namely Q-learning and Sarsa algorithms, for global path planning of mobile robot in unknown environm…☆514Apr 25, 2022Updated 4 years ago
- ☆10Jul 1, 2019Updated 6 years ago
- ☆15Oct 8, 2024Updated last year
- A demo program to illustrate python code usage in unity☆14Jul 21, 2021Updated 4 years ago
- ☆11Feb 27, 2024Updated 2 years ago
- pybullet_animations☆12Nov 13, 2017Updated 8 years ago
- Code for Deep Structured Mixtures of Gaussian Processes (DSMGPs)☆11Jan 27, 2022Updated 4 years ago
- [IEEE Transactions on Intelligent Transportation Systems] Curricular Subgoal for Inverse Reinforcement Learning☆18Jul 31, 2023Updated 2 years ago
- How (but not why) to do Twitter sociolinguistic analysis in the Unix Shell☆10Apr 19, 2016Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reshape text☆15Apr 21, 2022Updated 4 years ago
- Solve complex real-world problems by mastering reinforcement learning algorithms using OpenAI Gym and TensorFlow☆22Jan 30, 2023Updated 3 years ago
- Differential forms in Julia☆15Mar 16, 2024Updated 2 years ago
- Project for my graduate neural networks course - combining RL with VAEs☆22Nov 10, 2019Updated 6 years ago
- Prototype code for some Julia-OCaml bindings☆16Jan 3, 2021Updated 5 years ago
- 可视化量化机器学习论文关系的知识图谱系统☆25Dec 9, 2025Updated 6 months ago
- IterGANs: Iterative GANs for rotating visual objects☆13Aug 27, 2018Updated 7 years ago