A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow
☆15Apr 27, 2018Updated 7 years ago
Alternatives and similar repositories for Tensorflow-DeepMind-Atari-Deep-Q-Learner-2Player
Users that are interested in Tensorflow-DeepMind-Atari-Deep-Q-Learner-2Player are comparing it to the libraries listed below
Sorting:
- Multiagent Cooperation and Competition with Deep Reinforcement Learning☆123Nov 26, 2015Updated 10 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆35May 14, 2019Updated 6 years ago
- Collaborative Deep Reinforcement Learning☆32Jul 29, 2017Updated 8 years ago
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- ☆14Aug 12, 2024Updated last year
- The Chemical Reaction Optimization (CRO) algorithm with dependent classes in python 3.☆11Apr 21, 2020Updated 5 years ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- A Caffe/C++ implementation of Deep Deterministic Policy Gradient☆10Feb 1, 2019Updated 7 years ago
- ADAPTIVE RESONANCE THEORY. Gail A. Carpenter and Stephen Grossberg☆10Feb 10, 2015Updated 11 years ago
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- Official PyTorch Implementation of Federated Learning with Positive and Unlabeled Data☆10Aug 12, 2022Updated 3 years ago
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆24Jun 8, 2025Updated 9 months ago
- Code for "Traffic Signal Cycle Control with Centralized Critic and Decentralized Actors under Varying Intervention Frequencies"☆11Jun 27, 2025Updated 8 months ago
- Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation☆11Jul 26, 2016Updated 9 years ago
- ☆10Sep 20, 2018Updated 7 years ago
- ☆10Nov 27, 2019Updated 6 years ago
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- Official implementation for the paper "Sample-Then-Optimize Batch Neural Thompson Sampling", published at NeurIPS 2022.☆10Oct 13, 2022Updated 3 years ago
- PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"☆13Apr 19, 2023Updated 2 years ago
- COSE: Configuring Serverless Functions using Statistical Learning☆10Jun 28, 2023Updated 2 years ago
- this is for visual servoing of a turtlebot combined with navigation management☆13Feb 11, 2019Updated 7 years ago
- Task models for human robot collaboration☆12Jul 17, 2018Updated 7 years ago
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"☆10Mar 22, 2023Updated 2 years ago
- Multi-view Broad Learning Systerm☆10Mar 20, 2022Updated 3 years ago
- TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning☆11Oct 18, 2022Updated 3 years ago
- Google AI Research☆10Mar 11, 2020Updated 5 years ago
- Low-level autonomous control and tracking of quadrotor using reinforcement learning - Proximal Policy Optimization☆11Dec 2, 2020Updated 5 years ago
- The source code of the paper "Compressed Federated Learning Based on Adaptive Local Differential Privacy".☆10Oct 23, 2023Updated 2 years ago
- ☆11May 27, 2022Updated 3 years ago
- ZJU Robotics project of differential drive car path planning and trajectory planning based on the Client simulation platform (my freshman…☆10Dec 2, 2020Updated 5 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- I added selfplay functionality to openai gyms☆10Jan 16, 2021Updated 5 years ago
- Code for Federated Generalized Bayesian Learning via Distributed Stein Variational Gradient Descent☆10Nov 19, 2020Updated 5 years ago
- ☆10Jul 20, 2020Updated 5 years ago
- ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.☆10Oct 27, 2017Updated 8 years ago
- Low-rank Tensor Based Proximity Learning for Multi-view Clustering, TKDE2022☆11Dec 31, 2021Updated 4 years ago
- core placement optimization☆13Dec 25, 2021Updated 4 years ago