Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)
☆19Jul 20, 2018Updated 7 years ago
Alternatives and similar repositories for LOLA-pytorch
Users that are interested in LOLA-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Dec 8, 2022Updated 3 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆150Apr 13, 2023Updated 3 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Nov 29, 2022Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆57Aug 30, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆27Oct 25, 2019Updated 6 years ago
- Lupa for Torch☆10Sep 16, 2015Updated 10 years ago
- OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206☆11Aug 30, 2024Updated last year
- This is an implementation of the tic-tac-toe game as a gym environment. It can be used to make the computer learn playing the Tic-Tac-Toe…☆26Jan 6, 2019Updated 7 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆98Aug 21, 2018Updated 7 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 7 years ago
- High granularity and accuracy Starcraft replay data extractor which outputs to a database☆14Feb 18, 2022Updated 4 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Differential game theory for multi-agent collision avoidance. Simulations set up.☆12Jan 27, 2021Updated 5 years ago
- ☆11Dec 16, 2025Updated 6 months ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 7 years ago
- ☆13Oct 11, 2022Updated 3 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆11Jun 24, 2022Updated 3 years ago
- Python implement of paper "PD-FAC: Probability Density Factorized Multi-Agent Distributional Reinforcement Learning for Multi-Robot Relia…☆12Mar 5, 2022Updated 4 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆43Oct 5, 2022Updated 3 years ago
- Multiagent gridworld for the TEAM project based on gym-minigrid☆12Nov 27, 2019Updated 6 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Apr 15, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- There will be updates later☆88May 13, 2019Updated 7 years ago
- Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City☆12Dec 17, 2023Updated 2 years ago
- This project offers to solve Multi-Agent-Path-Finding(MAPF) problem optimally using Conflict-Based Search(CBS).☆14Aug 31, 2022Updated 3 years ago
- Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch☆10Aug 2, 2020Updated 5 years ago
- 桌面天气预报(基于Qt5,代码结构清晰并含有详细注释)☆11Jul 29, 2023Updated 2 years ago
- Flight Gear Multiplayer Server [MIRROR]☆18Mar 3, 2025Updated last year
- Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning☆27May 15, 2020Updated 6 years ago
- Ledger Nano Kusama / Polkadot integration library + examples☆16Jun 8, 2026Updated last week
- Optimal Control and Trajectory Tracking for the Ballbot (Nagarajan et al. 2014)☆18Dec 14, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- ☆14Aug 26, 2018Updated 7 years ago
- A framework that calibrates object properties through differentiable simulations of robot-object interactions.☆25May 3, 2025Updated last year
- cgo wrappers around post-quantum cryptography primitives☆23Dec 3, 2018Updated 7 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- Rails application that allows humans to play poker matches managed by the Annual Computer Poker Competition's Dealer program in a web GUI…☆11Apr 25, 2015Updated 11 years ago
- ☆48Dec 8, 2022Updated 3 years ago