PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
☆28Feb 16, 2021Updated 5 years ago
Alternatives and similar repositories for Normalized-Advantage-Function-NAF-
Users that are interested in Normalized-Advantage-Function-NAF- are comparing it to the libraries listed below
Sorting:
- RoboCup challenge implementations☆44Feb 4, 2026Updated last month
- Pytorch implementation of AREL☆16Dec 20, 2021Updated 4 years ago
- [AAAI-25] Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.☆28May 29, 2025Updated 9 months ago
- Solving several OpenAI Gym and custom gazebo environments using reinforcement learning techniques.☆19Jan 20, 2019Updated 7 years ago
- [NeurIPS 2024] Code for Federated Ensemble-Directed Offline Reinforcement Learning☆27Sep 25, 2024Updated last year
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆37Jun 14, 2024Updated last year
- Solve Multi-agent Path Finding problem for heterogeneous robots.☆31Feb 24, 2021Updated 5 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- ☆32Apr 25, 2021Updated 4 years ago
- Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)☆33Mar 16, 2020Updated 5 years ago
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆42Oct 14, 2023Updated 2 years ago
- Implementation of Continuous Control RL Algorithms☆11Dec 8, 2022Updated 3 years ago
- ☆14Aug 12, 2024Updated last year
- A Caffe/C++ implementation of Deep Deterministic Policy Gradient☆10Feb 1, 2019Updated 7 years ago
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- ADAPTIVE RESONANCE THEORY. Gail A. Carpenter and Stephen Grossberg☆10Feb 10, 2015Updated 11 years ago
- Official implementation for the UOF paper (algorithm & environment)☆33Jun 15, 2023Updated 2 years ago
- Official PyTorch Implementation of Federated Learning with Positive and Unlabeled Data☆10Aug 12, 2022Updated 3 years ago
- Experimental Bmad Code transcribed in Python with Numba and Pytorch☆14Jun 21, 2024Updated last year
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- IndoorEnvironmentMonitoringSystem是基于物联网、人工智能、BIM的背景下实现建筑室内环境质量无线监控系统。实现传感器实时无线采集建筑环境信息并上传到上位机,利用深度学习LSTM搭建模型评价环境质量等级,并实时显示到BIM模型中和网页上。本设计…☆10Jul 10, 2023Updated 2 years ago
- Custom PD controller and robot emulation for Franka-Emika Panda arms☆10Aug 25, 2020Updated 5 years ago
- ROS wrapper for SMAC, a versatile tool for optimizing algorithm parameters☆11Jul 19, 2021Updated 4 years ago
- whale是kaggle比赛[鲸鱼图像](https://www.kaggle.com/c/humpback-whale-identification/)识别的简化版,本文是对Top3大神pudae算法复现☆10Jun 17, 2023Updated 2 years ago
- The source code of the paper "Compressed Federated Learning Based on Adaptive Local Differential Privacy".☆10Oct 23, 2023Updated 2 years ago
- Low-rank Tensor Based Proximity Learning for Multi-view Clustering, TKDE2022☆11Dec 31, 2021Updated 4 years ago
- ☆11May 27, 2022Updated 3 years ago
- core placement optimization☆13Dec 25, 2021Updated 4 years ago
- Low-level autonomous control and tracking of quadrotor using reinforcement learning - Proximal Policy Optimization☆11Dec 2, 2020Updated 5 years ago
- Designing an optimized path for multiple robots in a warehouse for picking and delivery operations using A* algorithm (shortest path) and…☆11Jul 28, 2023Updated 2 years ago
- My personal LaTeX class for taking notes.☆10Aug 27, 2022Updated 3 years ago
- Official implementation for the paper "Sample-Then-Optimize Batch Neural Thompson Sampling", published at NeurIPS 2022.☆10Oct 13, 2022Updated 3 years ago
- Centralized cooperative reinforcement learning☆13Jan 8, 2023Updated 3 years ago
- PyTorch implementation of the "Learning an Adaptive Learning Rate Schedule" paper found here: https://arxiv.org/abs/1909.09712.☆12Jan 15, 2020Updated 6 years ago
- Benchmark codebase for 2D range finder based people detectors using the FROG dataset☆12Oct 20, 2025Updated 4 months ago
- Multi-layer perceptron, Autoencoder, and Restricted Boltzmann Machine☆10Sep 15, 2018Updated 7 years ago
- ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.☆10Oct 27, 2017Updated 8 years ago