Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)
☆19Jul 20, 2018Updated 7 years ago
Alternatives and similar repositories for LOLA-pytorch
Users that are interested in LOLA-pytorch are comparing it to the libraries listed below
Sorting:
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆55Aug 30, 2024Updated last year
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Dec 8, 2022Updated 3 years ago
- This is an implementation of the tic-tac-toe game as a gym environment. It can be used to make the computer learn playing the Tic-Tac-Toe…☆26Jan 6, 2019Updated 7 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Nov 29, 2022Updated 3 years ago
- Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning☆27May 15, 2020Updated 5 years ago
- Setup generator for the board game Spirit Island 🏝️☆10Nov 24, 2023Updated 2 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆41Oct 5, 2022Updated 3 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago
- ☆11May 13, 2021Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- Lupa for Torch☆10Sep 16, 2015Updated 10 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- There will be updates later☆88May 13, 2019Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Aug 21, 2018Updated 7 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- neuralpy - neural network library written in python☆12Jun 25, 2023Updated 2 years ago
- ☆11Dec 16, 2025Updated 2 months ago
- Tools for optimizing quantum circuits and for performing rescaling-based error mitigation.☆10Jun 21, 2022Updated 3 years ago
- We have a dataset which contains various features of the car based on which we predict the Carbon dioxide emission.☆15Oct 3, 2018Updated 7 years ago
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- 变邻域搜索算法(VNS)求解TSP(附C++详细代码及注释)☆10May 12, 2019Updated 6 years ago
- Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City☆12Dec 17, 2023Updated 2 years ago
- A tiny python2.7 script which converts LaTex projects into arxiv-format. Suggestions are welcome.☆10Mar 20, 2016Updated 9 years ago
- Automatically exported from code.google.com/p/world-opponent-network☆14Jul 28, 2015Updated 10 years ago
- Example project for developing a mod for Sentinels of the Multiverse☆15Oct 4, 2024Updated last year
- Model-free policy gradient algorithm for LQR☆10Apr 8, 2020Updated 5 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- ☆10Apr 24, 2021Updated 4 years ago
- HRHD-HK: A Benchmark Dataset of High-Rise and High-Density Urban Scenes for 3D Semantic Segmentation of Photogrammetric Point Clouds☆10Dec 11, 2023Updated 2 years ago
- ☆12Feb 29, 2020Updated 6 years ago
- ☆12Jan 11, 2021Updated 5 years ago
- Rails application that allows humans to play poker matches managed by the Annual Computer Poker Competition's Dealer program in a web GUI…☆10Apr 25, 2015Updated 10 years ago
- Reinforcement Learning from Hierarchical Critics☆13Jul 30, 2020Updated 5 years ago
- 桌面天气预报(基于Qt5,代码结构清晰并含有详细注释)☆11Jul 29, 2023Updated 2 years ago
- 😈 Train ViZDoom agents by Reinforcement Learning 👻☆12Dec 5, 2017Updated 8 years ago