agakshat / LOLA-pytorchView external linksLinks
Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)
☆19Jul 20, 2018Updated 7 years ago
Alternatives and similar repositories for LOLA-pytorch
Users that are interested in LOLA-pytorch are comparing it to the libraries listed below
Sorting:
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- This is an implementation of the tic-tac-toe game as a gym environment. It can be used to make the computer learn playing the Tic-Tac-Toe…☆26Jan 6, 2019Updated 7 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Nov 29, 2022Updated 3 years ago
- Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning☆27May 15, 2020Updated 5 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆39Mar 28, 2018Updated 7 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- ☆11May 13, 2021Updated 4 years ago
- ☆12Apr 4, 2023Updated 2 years ago
- The official implementation of AAAI 2024 paper: Estimating On-road Transportation Carbon Emissions from Open Data of Road Network and Ori…☆11Feb 24, 2024Updated last year
- Reinforcement Learning for Fault-Tolerant Quantum Circuit Discovery☆16Jan 16, 2026Updated 3 weeks ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Dec 7, 2020Updated 5 years ago
- Example project for developing a mod for Sentinels of the Multiverse☆15Oct 4, 2024Updated last year
- A tiny python2.7 script which converts LaTex projects into arxiv-format. Suggestions are welcome.☆10Mar 20, 2016Updated 9 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- Tools for optimizing quantum circuits and for performing rescaling-based error mitigation.☆10Jun 21, 2022Updated 3 years ago
- A simple ptuc to C compiler using flex and bison.☆10May 1, 2018Updated 7 years ago
- neuralpy - neural network library written in python☆12Jun 25, 2023Updated 2 years ago
- We have a dataset which contains various features of the car based on which we predict the Carbon dioxide emission.☆15Oct 3, 2018Updated 7 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Automatically exported from code.google.com/p/world-opponent-network☆14Jul 28, 2015Updated 10 years ago
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City☆11Dec 17, 2023Updated 2 years ago
- ☆10Mar 10, 2021Updated 4 years ago
- ☆11Dec 16, 2025Updated last month
- ☆10May 22, 2023Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- ☆12Feb 29, 2020Updated 5 years ago
- hsvgbkhgbv / Thermostat-assisted-continuously-tempered-Hamiltonian-Monte-Carlo-for-Bayesian-learningThermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning☆10Dec 10, 2018Updated 7 years ago
- ☆10Apr 24, 2021Updated 4 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Resilient Multi-Agent Reinforcement Learning☆10Nov 4, 2022Updated 3 years ago
- ☆10Apr 13, 2023Updated 2 years ago
- Code from my Medium article about React and Socket.io☆10May 8, 2019Updated 6 years ago
- 桌面天气预报(基于Qt5,代码结构清晰并含有详细注释)☆11Jul 29, 2023Updated 2 years ago
- ☆12Jan 11, 2021Updated 5 years ago