geek-ai/1m-agents

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/geek-ai/1m-agents)

geek-ai / 1m-agents

A platform of grid world that supports up to 1 million reinforcement-learning agents.

☆70

Alternatives and similar repositories for 1m-agents

Users that are interested in 1m-agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

geek-ai / MAgent
View on GitHub
A Platform for Many-Agent Reinforcement Learning
☆1,761Oct 22, 2022Updated 3 years ago
ehknight / natural-gradient-deep-q-learning
View on GitHub
☆23Oct 7, 2018Updated 7 years ago
openai / robosumo
View on GitHub
Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"
☆309Apr 13, 2023Updated 3 years ago
Kaixhin / malmo-challenge
View on GitHub
Malmo Collaborative AI Challenge - Team Pig Catcher
☆64May 22, 2017Updated 9 years ago
muupan / predictron
View on GitHub
WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer
☆11Dec 31, 2016Updated 9 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
facebookresearch / M3RL
View on GitHub
Mind-aware Multi-agent Management Reinforcement Learning
☆80Mar 6, 2019Updated 7 years ago
yuandong-tian / ICML17_ReLU
View on GitHub
☆29May 17, 2017Updated 9 years ago
iassael / learning-to-communicate
View on GitHub
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
☆449Feb 21, 2019Updated 7 years ago
avivt / VIN
View on GitHub
Value Iteration Networks
☆291Apr 21, 2017Updated 9 years ago
steveKapturowski / tensorflow-rl
View on GitHub
Implementations of deep RL papers and random experimentation
☆178Apr 7, 2018Updated 8 years ago
Nat-D / FeatureControlHRL
View on GitHub
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆81Nov 22, 2017Updated 8 years ago
mfranzs / meta-learning-curiosity-algorithms
View on GitHub
☆80Oct 3, 2023Updated 2 years ago
Kaixhin / NoisyNet-A3C
View on GitHub
Noisy Networks for Exploration
☆187Jan 28, 2018Updated 8 years ago
apsdehal / ic3net-envs
View on GitHub
Environments with IC3Net paper
☆15Jan 8, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kazizzad / DCGAN-Gluon-MxNet
View on GitHub
☆12Sep 30, 2017Updated 8 years ago
zhongwen / predictron
View on GitHub
Tensorflow implementation of "The Predictron: End-To-End Learning and Planning"
☆289Jan 20, 2017Updated 9 years ago
LucasAlegre / mbcd
View on GitHub
Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"
☆11Aug 7, 2023Updated 2 years ago
vlfeat / researchdoom-chocolate
View on GitHub
ResearchDoom fork of the Chocolate Doom engine.
☆16Oct 20, 2017Updated 8 years ago
openai / coinrun
View on GitHub
Code for the paper "Quantifying Transfer in Reinforcement Learning"
☆405Oct 7, 2023Updated 2 years ago
valeriechen / ask-your-humans
View on GitHub
Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"
☆11Apr 8, 2025Updated last year
miyosuda / episodic_control
View on GitHub
Model-Free Episodic Control
☆14Jan 12, 2017Updated 9 years ago
williamFalcon / DeepRLHacks
View on GitHub
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)
☆1,126Oct 13, 2017Updated 8 years ago
jacobBaumbach / MCWMD
View on GitHub
Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow
☆12Sep 12, 2016Updated 9 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
facebookarchive / CommNet
View on GitHub
Neural network model, suitable for multi-agent learning. https://arxiv.org/abs/1605.07736
☆220Jan 3, 2017Updated 9 years ago
aa14k / Exploration-in-RL
View on GitHub
☆29May 27, 2024Updated 2 years ago
Philip-Bachman / MatNets-NIPS
View on GitHub
Repo for code for the NIPS paper entitled "An Architecture for Deep, Hierarchical Generative Models"
☆14Oct 27, 2016Updated 9 years ago
lns / dapo
View on GitHub
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Nov 28, 2019Updated 6 years ago
ChunyuanLI / RAS
View on GitHub
AISTATS 2019: Reference-based Adversarial Sampling & Its applications to Soft Q-learning
☆15Jan 21, 2019Updated 7 years ago
sisl / MADRL
View on GitHub
Repo containing code for multi-agent deep reinforcement learning (MADRL).
☆750Jul 7, 2026Updated 2 weeks ago
mlii / mfrl
View on GitHub
Mean Field Multi-Agent Reinforcement Learning
☆422Mar 11, 2020Updated 6 years ago
haarnoja / softqlearning
View on GitHub
Reinforcement Learning with Deep Energy-Based Policies
☆438Nov 28, 2023Updated 2 years ago
Breakend / EthicsInDialogue
View on GitHub
☆18Feb 14, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HumanCompatibleAI / population-irl
View on GitHub
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆27Jun 20, 2019Updated 7 years ago
Buzz-Beater / GEP_PAMI
View on GitHub
Code for TPAMI 2020 paper - A Generalized Earley Parser for Human Activity Parsing and Prediction
☆13Nov 23, 2020Updated 5 years ago
aicenter / TensorCFR
View on GitHub
☆10Feb 28, 2019Updated 7 years ago
psygement / DRS
View on GitHub
Deep recommendation system
☆13Dec 28, 2016Updated 9 years ago
ExpectationBackpropagation / EBP_Matlab_Code
View on GitHub
☆11Sep 15, 2015Updated 10 years ago
davidhershey / feudal_networks
View on GitHub
An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161
☆186Nov 1, 2017Updated 8 years ago
ElofssonLab / PconsC2
View on GitHub
Improved contact predictions using the recognition of protein like contact patterns.
☆14May 18, 2018Updated 8 years ago