jeffasante / grpo-maze-solverView external linksLinks
A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).
☆12Feb 9, 2025Updated last year
Alternatives and similar repositories for grpo-maze-solver
Users that are interested in grpo-maze-solver are comparing it to the libraries listed below
Sorting:
- Snake's Food Hunt" is a competitive AI-driven game where two snakes learn to navigate, collect food, and avoid collisions using Deep Q-Le…☆10Nov 18, 2025Updated 2 months ago
- A floating offshore wind farm simulation and flow control framework using FLORIS, MoorPy, and deep reinforcement learning☆19Jan 28, 2026Updated 2 weeks ago
- 🐭 A tiny single-file implementation of Group Relative Policy Optimization (GRPO) as introduced by the DeepSeekMath paper☆39Jun 28, 2025Updated 7 months ago
- A command line tool for comparing JSON files by degree of similarity.☆11Oct 28, 2019Updated 6 years ago
- 此项目创建的初衷是为了帮助人工智能、自然语言处理和大语言模型相关背景的同学找工作使用,欢迎加入项目的建设和维护☆15Mar 30, 2025Updated 10 months ago
- Implementation of PPO for CartPole-v1☆10Jan 1, 2019Updated 7 years ago
- Autonomous UAV navigation using Deep Reinforcement Learning (DQN). The UAV learns to efficiently navigate grid-based environments, avoid …☆13Feb 11, 2025Updated last year
- Vision-driven Autonomous Flight of UAV Along River Using Deep Reinforcement Learning with Dynamic Expert Guidance☆13Mar 8, 2025Updated 11 months ago
- 一个基于 GitHub Actions 的自动化工具,每天早上自动追踪和分析 arXiv 最新论文,并通过邮件发送分析报告。该工具使用 DeepSeek AI 进行论文分析和总结。☆21Jun 20, 2025Updated 7 months ago
- 汽车出租小项目,使用ssm框架以及layui☆12Dec 16, 2022Updated 3 years ago
- The codebase for Inducing Causal Structure for Interpretable Neural Networks☆11Dec 3, 2021Updated 4 years ago
- 2025ICASSP☆16Jun 23, 2025Updated 7 months ago
- Implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers☆48Feb 5, 2026Updated last week
- ☆13Mar 22, 2023Updated 2 years ago
- Evaluate language models using multiple choice items☆13Jan 15, 2026Updated 3 weeks ago
- One-Shot Unsupervised Cross Domain Detection☆13Nov 22, 2022Updated 3 years ago
- ☆12Nov 12, 2022Updated 3 years ago
- A Python package for building and cutting sparse layered s-t graphs.☆13Nov 6, 2023Updated 2 years ago
- An open source robot reinforcement learing plantform using stable-baselines and OpenAI Gym☆10Mar 24, 2023Updated 2 years ago
- Llama-style transformer in PyTorch with multi-node / multi-GPU training. Includes pretraining, fine-tuning, DPO, LoRA, and knowledge dist…☆21Feb 5, 2026Updated last week
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆12Jan 26, 2025Updated last year
- 4WD Mecanum Mobile Robot ROS 1&2 Ready☆15Jun 6, 2024Updated last year
- LLM Prompting for Text2SQL via Gradual SQL Reffnement☆15Feb 19, 2025Updated 11 months ago
- Hdl21 Schematics☆17Jan 24, 2024Updated 2 years ago
- ☆11Jan 9, 2025Updated last year
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆23Dec 14, 2025Updated last month
- ☆14Oct 28, 2023Updated 2 years ago
- DepthNav is a research framework for developing and evaluating autonomous navigation policies, particularly for aerial robots in complex …☆28Nov 17, 2025Updated 2 months ago
- Face Recognition Door Lock☆16Nov 22, 2022Updated 3 years ago
- ☆17Feb 26, 2024Updated last year
- Official implementation of depth from focus using the ring difference filter (RDF)☆16Jan 21, 2020Updated 6 years ago
- Systems Modeling. Learn a variety of systems, such as those involving mechanical, electrical, hydraulic, pneumatic systems, and mixtures …☆16Dec 20, 2017Updated 8 years ago
- QPBO interface and alpha expansion for Python☆24Nov 3, 2022Updated 3 years ago
- Solutions to neuralnetworksanddeeplearning.com☆14Dec 21, 2016Updated 9 years ago
- This project aims to develop a UAV system that can autonomously land on a moving platform using computer vision and control systems. The …☆21Feb 2, 2026Updated last week
- This repository contains a Genetic Algorithm (GA) implementation for solving the Traveling Salesman Problem (TSP).☆16Mar 23, 2025Updated 10 months ago
- ☆15Dec 29, 2020Updated 5 years ago
- this repo is mnbvc text quality classification using fastText☆16Oct 2, 2023Updated 2 years ago
- ☆28Jun 12, 2025Updated 8 months ago