testing MLP, DQN, PPO, SAC, policy-gradient by snakeAI
☆11May 6, 2025Updated last year
Alternatives and similar repositories for snakeAI
Users that are interested in snakeAI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Long Short Term Memory neural network for time series prediction. Memory blocks contain one memory cell in each. Weights for the networ…☆15Sep 3, 2018Updated 7 years ago
- Blazingly Fast Implementation of Deep Q-Network in C++ with NNabla☆18Mar 3, 2020Updated 6 years ago
- Simple verification experiments codes for multi-agent RL using OpenAI MPE environment☆34Jun 22, 2022Updated 3 years ago
- On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning by Ameya Pore and Gerardo Aragon-Camarasa☆11Jan 28, 2020Updated 6 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A MATLAB simple interactive Reinforcement Learning environment for Evolutionary Neural Network-based car with a proximity sensor☆14Apr 11, 2019Updated 7 years ago
- Here is our algorithm for Pursuit Problem based on the Distributed Reinforcement Learning for Cooperative Multi-robot Pursuit☆10Apr 17, 2019Updated 7 years ago
- 一个针对中文聊天机器人的公开数据集☆11Sep 11, 2019Updated 6 years ago
- KiCAD plugin written in Python for programatically placing clusters of components onto a PCB from a layout file.☆10Jun 30, 2021Updated 4 years ago
- VLSI placement and routing tool☆17Dec 20, 2025Updated 5 months ago
- Thisi is the official code base for paper "Think Before You Act: Decision Transformers with Internal Working Memory"☆23Jul 12, 2024Updated last year
- Deep Reinforcement Learning for Robotic Pushing and Picking in Cluttered Environment☆19Jun 2, 2020Updated 5 years ago
- Dynamic Partial Removal: a Neural Network Heuristic for Large Neighborhood Search on Combinatorial Optimization Problems, by applying dee…☆20Jun 17, 2020Updated 5 years ago
- ☆20Mar 1, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a project of Blood Pressure Meter on STM32 F446RE☆17Aug 12, 2019Updated 6 years ago
- Code for paper "JMDC: A Joint Model and Data Compression System for Deep Neural Networks Collaborative Computing in Edge-Cloud Networks"☆25Aug 24, 2025Updated 8 months ago
- ☆14Oct 23, 2018Updated 7 years ago
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"☆10Mar 22, 2023Updated 3 years ago
- In this work, we propose a novel formulation titled Federated Deep Q Networks (F-DQN) to perform distributed learning for Deep RL algorit…☆21Dec 25, 2020Updated 5 years ago
- Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.☆21Feb 13, 2023Updated 3 years ago
- Code for the paper "Continual Model-Based Reinforcement Learning with Hypernetworks"☆15Jul 28, 2021Updated 4 years ago
- Long Short-Term Memory implementation by c++☆29Aug 6, 2018Updated 7 years ago
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.☆10Oct 27, 2017Updated 8 years ago
- Official PyTorch Implementation of Federated Learning with Positive and Unlabeled Data☆10Aug 12, 2022Updated 3 years ago
- ADAPTIVE RESONANCE THEORY. Gail A. Carpenter and Stephen Grossberg☆10Feb 10, 2015Updated 11 years ago
- ZJU Robotics project of differential drive car path planning and trajectory planning based on the Client simulation platform (my freshman…☆10Dec 2, 2020Updated 5 years ago
- Google AI Research☆10Mar 11, 2020Updated 6 years ago
- Official implementation of MacroRank: Ranking Macro Placement Solutions Leveraging Translation Equivariancy (ASP-DAC 2023)☆18Jun 3, 2023Updated 2 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- this is for visual servoing of a turtlebot combined with navigation management☆13Feb 11, 2019Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Oct 19, 2020Updated 5 years ago
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- ☆10Jul 20, 2020Updated 5 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆21May 26, 2021Updated 4 years ago
- A Caffe/C++ implementation of Deep Deterministic Policy Gradient☆10Feb 1, 2019Updated 7 years ago
- ☆17Oct 12, 2023Updated 2 years ago