agakshat/LOLA-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/agakshat/LOLA-pytorch)

agakshat / LOLA-pytorch

Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)

☆19

Alternatives and similar repositories for LOLA-pytorch

Users that are interested in LOLA-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chaovven / SMIX
View on GitHub
Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020
☆26Dec 8, 2022Updated 3 years ago
alshedivat / lola
View on GitHub
Code release for Learning with Opponent-Learning Awareness and variations.
☆152Apr 13, 2023Updated 3 years ago
atavakol / action-hypergraph-networks
View on GitHub
(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Jun 22, 2021Updated 5 years ago
agakshat / spacefortress
View on GitHub
OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206
☆11Aug 30, 2024Updated last year
clvoloshin / constrained_batch_policy_learning
View on GitHub
☆27Oct 25, 2019Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jleni / lupa
View on GitHub
Lupa for Torch
☆10Sep 16, 2015Updated 10 years ago
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
alexis-jacq / LOLA_DiCE
View on GitHub
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆98Aug 21, 2018Updated 7 years ago
accelerate0818 / VariableNeighborhoodSearchTSP
View on GitHub
变邻域搜索算法(VNS)求解TSP（附C++详细代码及注释）
☆10May 12, 2019Updated 7 years ago
apoddar573 / Tic-Tac-Toe-Gym_Environment
View on GitHub
This is an implementation of the tic-tac-toe game as a gym environment. It can be used to make the computer learn playing the Tic-Tac-Toe…
☆26Jan 6, 2019Updated 7 years ago
ruizhaogit / mep
View on GitHub
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24May 30, 2019Updated 7 years ago
phoglenix / ScExtractor
View on GitHub
High granularity and accuracy Starcraft replay data extractor which outputs to a database
☆14Feb 18, 2022Updated 4 years ago
luizgiacomossi / Search_Drone_Swarms
View on GitHub
☆11Dec 16, 2025Updated 7 months ago
jachiam / surprise
View on GitHub
Surprise-based intrinsic motivation for deep reinforcement learning
☆21Mar 6, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jleguina / KheperaIV_ROS
View on GitHub
Differential game theory for multi-agent collision avoidance. Simulations set up.
☆12Jan 27, 2021Updated 5 years ago
caocscar / ConnectedVehicleDocs
View on GitHub
Documentation for UMTRI's Connected Vehicle Dataset
☆10Oct 15, 2020Updated 5 years ago
antonio-f / Dynamic-Programming
View on GitHub
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…
☆11Apr 3, 2019Updated 7 years ago
deligentfool / SIDE
View on GitHub
Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"
☆11Jun 24, 2022Updated 4 years ago
hhexiy / opponent
View on GitHub
Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"
☆71Apr 15, 2026Updated 3 months ago
GIS-PuppetMaster / Auto-STGCN
View on GitHub
source code of paper 'Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search Based on Reinforcement Learning and Exis…
☆11Jan 26, 2021Updated 5 years ago
YoZo-X / PD-FAC
View on GitHub
Python implement of paper "PD-FAC: Probability Density Factorized Multi-Agent Distributional Reinforcement Learning for Multi-Robot Relia…
☆12Mar 5, 2022Updated 4 years ago
uoe-agents / LIAM
View on GitHub
Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"
☆43Oct 5, 2022Updated 3 years ago
mila-iqia / teamgrid
View on GitHub
Multiagent gridworld for the TEAM project based on gym-minigrid
☆12Nov 27, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
cbschaff / evolving-soft-robots
View on GitHub
☆14May 17, 2022Updated 4 years ago
Sonkyunghwan / QTRAN
View on GitHub
There will be updates later
☆87May 13, 2019Updated 7 years ago
renweiya / RFQ-RFAC
View on GitHub
Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning
☆17Mar 11, 2020Updated 6 years ago
Sheepsody / Batched-Impala-PyTorch
View on GitHub
Reinforcement learning - Batched Impala - PyTorch - Mario Kart
☆13Jul 21, 2020Updated 6 years ago
tsinghua-fib-lab / UGI
View on GitHub
Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City
☆12Dec 17, 2023Updated 2 years ago
flowersteam / curious
View on GitHub
Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
☆27May 15, 2020Updated 6 years ago
hui0927 / QtWeather
View on GitHub
桌面天气预报(基于Qt5,代码结构清晰并含有详细注释)
☆11Jul 29, 2023Updated 3 years ago
Zondax / ledger-substrate-js
View on GitHub
Ledger Nano Kusama / Polkadot integration library + examples
☆17Jun 22, 2026Updated last month
yswhynot / codesign-soft-gripper
View on GitHub
☆19Sep 9, 2025Updated 10 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
noambrown / acpc_poker_gui_client
View on GitHub
Rails application that allows humans to play poker matches managed by the Annual Computer Poker Competition's Dealer program in a web GUI…
☆11Apr 25, 2015Updated 11 years ago
deligentfool / maddpg
View on GitHub
Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch
☆10Aug 2, 2020Updated 5 years ago
snu-mllab / EMI
View on GitHub
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
ml3705454 / mapr2
View on GitHub
☆48Dec 8, 2022Updated 3 years ago
berkeleyflow / flow
View on GitHub
☆14Aug 26, 2018Updated 7 years ago
lns / dapo
View on GitHub
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Nov 28, 2019Updated 6 years ago
bastien-muraccioli / svlr
View on GitHub
SVLR: Scalable, Training-Free Visual Language Robotics: a modular multi-model framework for consumer-grade GPUs
☆15Jan 22, 2026Updated 6 months ago