testing MLP, DQN, PPO, SAC, policy-gradient by snakeAI
☆11Jun 24, 2026Updated last week
Alternatives and similar repositories for snakeAI
Users that are interested in snakeAI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Long Short Term Memory neural network for time series prediction. Memory blocks contain one memory cell in each. Weights for the networ…☆15Sep 3, 2018Updated 7 years ago
- Blazingly Fast Implementation of Deep Q-Network in C++ with NNabla☆18Mar 3, 2020Updated 6 years ago
- Simple verification experiments codes for multi-agent RL using OpenAI MPE environment☆35Jun 22, 2022Updated 4 years ago
- On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning by Ameya Pore and Gerardo Aragon-Camarasa☆11Jan 28, 2020Updated 6 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A MATLAB simple interactive Reinforcement Learning environment for Evolutionary Neural Network-based car with a proximity sensor☆14Apr 11, 2019Updated 7 years ago
- Here is our algorithm for Pursuit Problem based on the Distributed Reinforcement Learning for Cooperative Multi-robot Pursuit☆10Apr 17, 2019Updated 7 years ago
- Dynamic Partial Removal: a Neural Network Heuristic for Large Neighborhood Search on Combinatorial Optimization Problems, by applying dee…☆20Jun 17, 2020Updated 6 years ago
- ☆20Mar 1, 2019Updated 7 years ago
- Code for paper "JMDC: A Joint Model and Data Compression System for Deep Neural Networks Collaborative Computing in Edge-Cloud Networks"☆25Jun 17, 2026Updated 2 weeks ago
- Artificial Neural Networks C++ library providing implementation of some common architectures of neural networks.☆24May 18, 2020Updated 6 years ago
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"☆10Mar 22, 2023Updated 3 years ago
- In this work, we propose a novel formulation titled Federated Deep Q Networks (F-DQN) to perform distributed learning for Deep RL algorit…☆21Dec 25, 2020Updated 5 years ago
- Code for the paper "Continual Model-Based Reinforcement Learning with Hypernetworks"☆15Jul 28, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Long Short-Term Memory implementation by c++☆29Aug 6, 2018Updated 7 years ago
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.☆10Oct 27, 2017Updated 8 years ago
- ADAPTIVE RESONANCE THEORY. Gail A. Carpenter and Stephen Grossberg☆10Feb 10, 2015Updated 11 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 7 years ago
- Google AI Research☆10Mar 11, 2020Updated 6 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- ☆10Jul 20, 2020Updated 5 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆21May 26, 2021Updated 5 years ago
- A Caffe/C++ implementation of Deep Deterministic Policy Gradient☆10Feb 1, 2019Updated 7 years ago
- ☆17Oct 12, 2023Updated 2 years ago
- Interface definitions for the Compute@Edge platform in witx.☆15Feb 11, 2022Updated 4 years ago
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- Implementation of Multi-Agent Object Impedance Controller☆10Sep 14, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- PyTorch implementation of the "Learning an Adaptive Learning Rate Schedule" paper found here: https://arxiv.org/abs/1909.09712.☆12Jan 15, 2020Updated 6 years ago
- Code for Federated Generalized Bayesian Learning via Distributed Stein Variational Gradient Descent☆10Nov 19, 2020Updated 5 years ago
- Dynamic Simulation Environments for Reinforcement Learning☆13Apr 17, 2021Updated 5 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- Quadruped Robot controller design and simulation on Webots☆12Apr 28, 2020Updated 6 years ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆13Jun 6, 2025Updated last year