jeffasante/grpo-maze-solver

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jeffasante/grpo-maze-solver)

jeffasante / grpo-maze-solver

A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).

☆12

Alternatives and similar repositories for grpo-maze-solver

Users that are interested in grpo-maze-solver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bnelo12 / PPO-Implemnetation
View on GitHub
Implementation of PPO for CartPole-v1
☆10Jan 1, 2019Updated 7 years ago
superlinear-ai / microGRPO
View on GitHub
🐭 A tiny single-file implementation of Group Relative Policy Optimization (GRPO) as introduced by the DeepSeekMath paper
☆43Jun 28, 2025Updated last year
kedar49 / Snake-Apple
View on GitHub
Snake's Food Hunt" is a competitive AI-driven game where two snakes learn to navigate, collect food, and avoid collisions using Deep Q-Le…
☆10Nov 18, 2025Updated 8 months ago
Vlsir / Hdl21Schematics
View on GitHub
Hdl21 Schematics
☆17Jan 24, 2024Updated 2 years ago
Gitnoter / Gitnoter
View on GitHub
基于`Git`仓库存储的`Markdown`笔记应用
☆22Nov 28, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
CORE-Robotics-Lab / ICCT
View on GitHub
☆18Jun 26, 2026Updated 3 weeks ago
YanzhaoShi / HSENet
View on GitHub
The official code and model of HSENet: Hybrid Spatial Encoding Network for 3D Medical Vision-Language Understanding.
☆15Sep 19, 2025Updated 10 months ago
CORE-Robotics-Lab / Interpretable_DDTS_AISTATS2020
View on GitHub
Public code for implementation and experiments with differentiable decision trees.
☆32Oct 17, 2024Updated last year
americast / DRL_HVAC
View on GitHub
Optimising electricity expenditure in an HVAC system under dynamic electricity pricing scheme and weather conditions using a DDPG model.
☆26Feb 6, 2022Updated 4 years ago
alva-ai / skills
View on GitHub
Build and deploy agentic finance applications on the Alva platform. Access 250+ financial data sources, run cloud-side analytics, backtes…
☆33Updated this week
arsfutura / smart-lock
View on GitHub
Face Recognition Door Lock
☆16Nov 22, 2022Updated 3 years ago
lijianwen1997 / Synergistic-Reinforcement-and-Imitation-Learning
View on GitHub
Vision-driven Autonomous Flight of UAV Along River Using Deep Reinforcement Learning with Dynamic Expert Guidance
☆15Mar 8, 2025Updated last year
quantumiracle / Cascading-Decision-Tree
View on GitHub
Open-source code for paper CDT: Cascading Decision Trees for Explainable Reinforcement Learning
☆41Oct 31, 2025Updated 8 months ago
freemansoft / Network-intrusion-dataset-creator
View on GitHub
☆12Nov 12, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Derek-TH-Wang / OpenRoboRL
View on GitHub
An open source robot reinforcement learing plantform using stable-baselines and OpenAI Gym
☆10Mar 24, 2023Updated 3 years ago
GSR-SQL / GSR
View on GitHub
LLM Prompting for Text2SQL via Gradual SQL Reffnement
☆15Feb 19, 2025Updated last year
Tarekshohdy688 / Mobile_Macnum_Robot
View on GitHub
4WD Mecanum Mobile Robot ROS 1&2 Ready
☆19Jun 6, 2024Updated 2 years ago
poudel-bibek / Urban-Control
View on GitHub
Joint Pedestrian and Vehicle Traffic Optimization in Urban Environments using Reinforcement Learning
☆17Sep 23, 2025Updated 9 months ago
reachtarunhere / nndl
View on GitHub
Solutions to neuralnetworksanddeeplearning.com
☆14Dec 21, 2016Updated 9 years ago
guang1997 / car_rent
View on GitHub
汽车出租小项目，使用ssm框架以及layui
☆12Dec 16, 2022Updated 3 years ago
coderspage / flask-sse
View on GitHub
☆15Dec 29, 2020Updated 5 years ago
AnhaoZhao-LLMer / A_Dynamic_Multi-Modal_Deep_Reinforcement_Learning_Framework_for_3D_Bin_Packing_Problem
View on GitHub
the Pytorch implementation of A Dynamic Multi-Modal Deep Reinforcement Learning Framework for 3D Bin Packing Problem
☆11Sep 6, 2025Updated 10 months ago
salesforce / sibling-rivalry
View on GitHub
Code for Sibling Rivalry and experiments presented in associated paper
☆18May 1, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
willer-lu / nlp-llm-interview
View on GitHub
此项目创建的初衷是为了帮助人工智能、自然语言处理和大语言模型相关背景的同学找工作使用，欢迎加入项目的建设和维护
☆18Mar 30, 2025Updated last year
HansdasC / Escape
View on GitHub
基于 Android Studio 与 Java 的 Android 端游戏应用，是一个结合 RPG 与 GalGame 模式的解密攻略类游戏，包含背包系统、地图系统、交易系统、存档系统等。
☆21Mar 11, 2024Updated 2 years ago
baidu / speech-samples
View on GitHub
百度语音示例
☆50Feb 28, 2018Updated 8 years ago
chmodsss / noizeus_corpora
View on GitHub
Speech corpora for the speech recognition evaluation system
☆21Mar 20, 2018Updated 8 years ago
fords / Systems-Modeling
View on GitHub
Systems Modeling. Learn a variety of systems, such as those involving mechanical, electrical, hydraulic, pneumatic systems, and mixtures …
☆16Dec 20, 2017Updated 8 years ago
dqxiu / KAssess
View on GitHub
☆14Oct 28, 2023Updated 2 years ago
Geo3ngel / JSON-Similarity-comparitor
View on GitHub
A command line tool for comparing JSON files by degree of similarity.
☆12Oct 28, 2019Updated 6 years ago
iit-DLSLab / croSTAta
View on GitHub
Cross-State Transition Attention Transformer for improved robotic manipulation with better temporal modeling; https://arxiv.org/abs/2510.…
☆19Mar 8, 2026Updated 4 months ago
VeloDC / oshot_detection
View on GitHub
One-Shot Unsupervised Cross Domain Detection
☆13Nov 22, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RoboEden / flatland-marl
View on GitHub
A multi-agent reinforcement learning solution to Flatland3 challenge.
☆18Feb 16, 2024Updated 2 years ago
symbench / spice-datasets
View on GitHub
SPICE Netlist Datasets: https://symbench.github.io/spice-datasets/
☆40Oct 10, 2023Updated 2 years ago
Skielex / slgbuilder
View on GitHub
A Python package for building and cutting sparse layered s-t graphs.
☆13Nov 6, 2023Updated 2 years ago
MaheepChaudhary / SAE-Ravel
View on GitHub
Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…
☆13Jan 26, 2025Updated last year
SZU-AdvTech-2024 / 059-Deep-Reinforcement-Learning-for-Task-Offloading-in-Mobile-Edge-Computing-Systems
View on GitHub
☆12Jan 9, 2025Updated last year
twni2016 / f-IRL
View on GitHub
Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020
☆45Jul 19, 2023Updated 3 years ago
aaksham / frozenlake
View on GitHub
Value & Policy Iteration for the frozenlake environment of OpenAI
☆15May 14, 2019Updated 7 years ago