A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow
☆15Apr 27, 2018Updated 8 years ago
Alternatives and similar repositories for Tensorflow-DeepMind-Atari-Deep-Q-Learner-2Player
Users that are interested in Tensorflow-DeepMind-Atari-Deep-Q-Learner-2Player are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multiagent Cooperation and Competition with Deep Reinforcement Learning☆123Nov 26, 2015Updated 10 years ago
- Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation☆11Jul 26, 2016Updated 9 years ago
- ☆13Nov 17, 2015Updated 10 years ago
- ☆13Apr 3, 2019Updated 7 years ago
- ☆10Nov 27, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Research project - real-time multi-agent pursuit a moving target☆17Mar 13, 2021Updated 5 years ago
- Collaborative Deep Reinforcement Learning☆32Jul 29, 2017Updated 8 years ago
- “华为杯”第十六届中国研究生数学建模F题 国家一等奖☆13Jul 7, 2020Updated 5 years ago
- ☆11Aug 23, 2020Updated 5 years ago
- Simulator for evaluating cloud/edge requests from connected vehicles and computing statistical analysis of the input network☆12May 7, 2018Updated 8 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 7 years ago
- Tensorflow implementation of DQN to control cart-pole from OpenAI gym environment☆14Sep 24, 2017Updated 8 years ago
- Multiagent reinforcement learning simulation framework - Undergraduate thesis in Mechatronics Engineering at the University of Brasília☆69Sep 30, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- Reinforcement Task Scheduling Project☆16Jun 3, 2019Updated 7 years ago
- 2019华为杯研究生数学建模比赛F题(二等奖)☆15Nov 11, 2019Updated 6 years ago
- The Chemical Reaction Optimization (CRO) algorithm with dependent classes in python 3.☆11Apr 21, 2020Updated 6 years ago
- 2019华为杯 第十六届研究生数学建模F题解决方案“基于最小代价与Dubins曲线的改进Dijkstra与A*算法航迹规划“源代码☆16Jan 16, 2020Updated 6 years ago
- ☆10Sep 20, 2018Updated 7 years ago
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆26May 21, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 面试必备基础知识☆12Mar 21, 2019Updated 7 years ago
- I added selfplay functionality to openai gyms☆10Jan 16, 2021Updated 5 years ago
- Multi-Graph Multi-Label Learning☆11May 30, 2018Updated 8 years ago
- 百度UIE抽取模型torch版训练预测框架☆12Nov 20, 2024Updated last year
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- suPER is a collaborative multi-agent RL algorithm☆14Jun 11, 2024Updated 2 years ago
- simple keras implement for 《Memory Fusion Network for Multi-view Sequential Learning》☆14Apr 9, 2021Updated 5 years ago
- ☆10Jul 1, 2019Updated 6 years ago
- Continuous Energy Minimization for Multitarget Tracking☆20Feb 9, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆15Feb 8, 2023Updated 3 years ago
- ☆11Apr 12, 2020Updated 6 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆35May 14, 2019Updated 7 years ago
- a q-learning algorithms on packet routing.☆14Dec 1, 2018Updated 7 years ago
- COSE: Configuring Serverless Functions using Statistical Learning☆10Jun 28, 2023Updated 2 years ago
- ☆15Dec 13, 2022Updated 3 years ago
- Java tool to translate VRP instances to VRP-REP unified format.☆11Nov 28, 2014Updated 11 years ago