geek-ai / 1m-agentsView external linksLinks
A platform of grid world that supports up to 1 million reinforcement-learning agents.
☆70Sep 13, 2017Updated 8 years ago
Alternatives and similar repositories for 1m-agents
Users that are interested in 1m-agents are comparing it to the libraries listed below
Sorting:
- A Platform for Many-Agent Reinforcement Learning☆1,757Oct 22, 2022Updated 3 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆309Apr 13, 2023Updated 2 years ago
- Mind-aware Multi-agent Management Reinforcement Learning☆81Mar 6, 2019Updated 6 years ago
- ☆23Oct 7, 2018Updated 7 years ago
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆10Aug 7, 2023Updated 2 years ago
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning☆444Feb 21, 2019Updated 6 years ago
- This project was created for Unity ML-Agents Challenge - https://connect.unity.com/challenges/ml-agents-1☆12Aug 15, 2020Updated 5 years ago
- ☆10Feb 28, 2019Updated 6 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- ResearchDoom fork of the Chocolate Doom engine.☆16Oct 20, 2017Updated 8 years ago
- Value Iteration Networks☆291Apr 21, 2017Updated 8 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Jun 20, 2019Updated 6 years ago
- Malmo Collaborative AI Challenge - Team Pig Catcher☆66May 22, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- A Multi-agent Learning Framework☆62May 10, 2021Updated 4 years ago
- Char RNN Language Model based on Tensorflow☆12May 25, 2016Updated 9 years ago
- Link Prediction using Knowledge Embedding in Tensorflow☆13Nov 10, 2016Updated 9 years ago
- Reinforcement Learning Project☆12Jan 16, 2017Updated 9 years ago
- Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"☆11Apr 8, 2025Updated 10 months ago
- Gibson中文介绍☆11Apr 26, 2018Updated 7 years ago
- PSYCH 291: Causal Cognition (https://tobiasgerstenberg.github.io/causal_cognition/)☆12May 23, 2019Updated 6 years ago
- an implementation of reinforcement learning problem, stock prices☆10Dec 26, 2016Updated 9 years ago
- ☆80Oct 3, 2023Updated 2 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆81Nov 22, 2017Updated 8 years ago
- Code for TPAMI 2020 paper - A Generalized Earley Parser for Human Activity Parsing and Prediction☆13Nov 23, 2020Updated 5 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Deep recommendation system☆13Dec 28, 2016Updated 9 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13May 7, 2015Updated 10 years ago
- ConvWave Searching for Gravitational Waves with Fully Convolutional Neural Nets☆14May 22, 2019Updated 6 years ago
- ☆12Sep 30, 2017Updated 8 years ago
- ☆29May 17, 2017Updated 8 years ago
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Dec 31, 2016Updated 9 years ago
- ☆29May 27, 2024Updated last year
- Repo containing code for multi-agent deep reinforcement learning (MADRL).☆733Apr 12, 2023Updated 2 years ago
- ☆17Feb 14, 2018Updated 8 years ago
- Environments with IC3Net paper☆14Jan 8, 2019Updated 7 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Mar 29, 2019Updated 6 years ago
- A tool for experimenting with evolutionary optimization methods for machine learning algorithms, by distributing the workload over a larg…☆14Dec 19, 2018Updated 7 years ago
- ☆11Jun 6, 2014Updated 11 years ago