Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)
☆19Jul 20, 2018Updated 7 years ago
Alternatives and similar repositories for LOLA-pytorch
Users that are interested in LOLA-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Dec 8, 2022Updated 3 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆152Apr 13, 2023Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Nov 29, 2022Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆55Aug 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆27Oct 25, 2019Updated 6 years ago
- Lupa for Torch☆10Sep 16, 2015Updated 10 years ago
- OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206☆11Aug 30, 2024Updated last year
- This is an implementation of the tic-tac-toe game as a gym environment. It can be used to make the computer learn playing the Tic-Tac-Toe…☆26Jan 6, 2019Updated 7 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Learning Long-Horizon Robot Exploration Strategies for Multi-Object Search in Continuous Action Spaces. http://multi-object-search.cs.uni…☆13Nov 29, 2022Updated 3 years ago
- High granularity and accuracy Starcraft replay data extractor which outputs to a database☆14Feb 18, 2022Updated 4 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Differential game theory for multi-agent collision avoidance. Simulations set up.☆12Jan 27, 2021Updated 5 years ago
- ☆10May 22, 2023Updated 2 years ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 6 years ago
- ☆13Oct 11, 2022Updated 3 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Jun 24, 2022Updated 3 years ago
- Python implement of paper "PD-FAC: Probability Density Factorized Multi-Agent Distributional Reinforcement Learning for Multi-Robot Relia…☆11Mar 5, 2022Updated 4 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆42Oct 5, 2022Updated 3 years ago
- Multiagent gridworld for the TEAM project based on gym-minigrid☆12Nov 27, 2019Updated 6 years ago
- There will be updates later☆88May 13, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14May 17, 2022Updated 3 years ago
- Presentation software based on nested, zoomable canvases and live code.☆20Aug 25, 2017Updated 8 years ago
- 变邻域搜索算法(VNS)求解TSP(附C++详细代码及注释)☆10May 12, 2019Updated 6 years ago
- Reinforcement learning - Batched Impala - PyTorch - Mario Kart☆13Jul 21, 2020Updated 5 years ago
- This project offers to solve Multi-Agent-Path-Finding(MAPF) problem optimally using Conflict-Based Search(CBS).☆13Aug 31, 2022Updated 3 years ago
- source code of paper 'Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search Based on Reinforcement Learning and Exis…☆11Jan 26, 2021Updated 5 years ago
- Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch☆10Aug 2, 2020Updated 5 years ago
- 桌面天气预报(基于Qt5,代码结构清晰并含 有详细注释)☆11Jul 29, 2023Updated 2 years ago
- Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning☆27May 15, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ledger Nano Kusama / Polkadot integration library + examples☆16Updated this week
- Optimal Control and Trajectory Tracking for the Ballbot (Nagarajan et al. 2014)☆17Dec 14, 2018Updated 7 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- ☆14Aug 26, 2018Updated 7 years ago
- A framework that calibrates object properties through differentiable simulations of robot-object interactions.☆24May 3, 2025Updated 10 months ago
- Repo for a generalised DQN Agent model capable of solving major discrete action space control problems☆18Aug 20, 2018Updated 7 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago