Implementation of Deep Q-learning to solve random mazes.
☆20Jun 17, 2021Updated 4 years ago
Alternatives and similar repositories for deep_Q_learning_maze
Users that are interested in deep_Q_learning_maze are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Apr 20, 2021Updated 4 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- ☆11Jan 23, 2017Updated 9 years ago
- Code to reproduce results from the paper: Prediction and Control in Continual Reinforcement Learning, NeurIPS 2023.☆13May 10, 2024Updated last year
- A Large-Scale Dataset for Stereo Matching in Autonomous Driving Scenarios☆10Oct 25, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs☆21Nov 1, 2025Updated 5 months ago
- Simulation of Ridesharing Market and the MDP Order Dispatch Policy☆21Mar 13, 2024Updated 2 years ago
- Bitonic sort using simd (avx/neon) instructions☆17Mar 14, 2022Updated 4 years ago
- Implementation of a multi-agent planning algorithm based on potential field method with a distributed feedback protocol. (Summer 2018)☆12Mar 31, 2019Updated 7 years ago
- Exact Verification of ReLU Neural Control Barrier Functions☆11Oct 13, 2023Updated 2 years ago
- "SuperstarGAN: Generative adversarial networks for image-to-image translation in large-scale domains" in Neural Networks (Volume 162, May…☆13Mar 30, 2023Updated 3 years ago
- An AIoT project based on PYNQ-Z2 FPGA Evaluation board. Reading image from usb camera and running yolov3-tiny detection with DPU and usin…☆11May 12, 2022Updated 3 years ago
- A gentle introduction to Isabelle and Isabelle/HOL☆19Mar 27, 2025Updated last year
- Pytorch implementation of "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labeled Nodes"☆18Jul 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- a project build the SSD net in pynq-z2☆15Aug 1, 2020Updated 5 years ago
- A fullstack Project management app developed with NextJs v13 using experimental appDir☆14Feb 12, 2023Updated 3 years ago
- Based on the thesis "Consensus-Based Decentralized Auctions for Robust Task Allocation", the realization of consensus-based auction algor…☆17Feb 28, 2024Updated 2 years ago
- ☆32Mar 19, 2024Updated 2 years ago
- Parallel sorting algorithm implementation in OpenMP and MPI☆19Apr 5, 2020Updated 6 years ago
- Deployment of Deep learning Image Super-Resolution Models in Xilinx Zynq MPSoC ZCU102☆19May 30, 2020Updated 5 years ago
- ☆22Feb 1, 2024Updated 2 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆35May 14, 2019Updated 6 years ago
- Source code of “Noah: Neural-optimized A* Search Algorithm for Graph Edit Distance Computation”, accepted by ICDE 2021. Authors: Lei Yang…☆21Aug 26, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆30Dec 22, 2024Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- ☆23May 30, 2018Updated 7 years ago
- Use nodejs to read serial data from the USB Port of your computer and send it to a front end web application using websockets☆22Mar 22, 2020Updated 6 years ago
- Deep Adaptive Design: Amortizing Sequential Bayesian Experimental Design☆40Jun 14, 2021Updated 4 years ago
- Related papers for Continual Reinforcement Learning.☆43Feb 8, 2026Updated 2 months ago
- Temporal-Hierarchical Memory Consolidation for Long-Horizon Conversational Agents☆102Apr 7, 2026Updated last week
- ROS package for autonomous navigation of AGVs in unknown cluttered environments using U-MPPI☆46Jan 31, 2025Updated last year
- Extremely fast non-cryptographic hash algorithm☆10Oct 9, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This is a test case repository for mapf_ros package☆28May 8, 2025Updated 11 months ago
- For C#:Half-precision floating-point format.☆12Sep 26, 2019Updated 6 years ago
- (Unity) A Material Property Block sample project for a tutorial video☆10Jan 5, 2023Updated 3 years ago
- Hierarchical Online Planning and Reinforcement Learning on Taxi☆32Oct 23, 2017Updated 8 years ago
- Implementation of robust adaptive control methods for the linear quadratic regulator☆36Dec 13, 2021Updated 4 years ago
- Integration of two camera 📷 modules to Basys 3 FPGA☆45Feb 8, 2023Updated 3 years ago
- Pynq projects and guides☆29Sep 11, 2018Updated 7 years ago