heidekrueger / bnelearnLinks

A Framework for Equilibrium Learning in Sealed-Bid Auctions

☆24

Alternatives and similar repositories for bnelearn

Users that are interested in bnelearn are comparing it to the libraries listed below

Sorting:

ssokota / mmd
Code for magnetic mirror descent.
☆16Updated last year
hsvgbkhgbv / shapley-q-learning
This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.
☆47Updated last year
causal-rl-anonymous / causal-rl
☆44Updated 3 years ago
BorealisAI / mtmfrl
Multi Type Mean Field Reinforcement Learning
☆31Updated 3 years ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆94Updated 3 weeks ago
tesatory / hsp
Hierarchical Self-Play
☆21Updated 6 years ago
facebookresearch / off-belief-learning
Implementation of the Off Belief Learning algorithm.
☆47Updated 2 years ago
clvoloshin / COBS
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Updated 2 years ago
thanhnguyentang / offline_neural_bandits
An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…
☆14Updated 3 years ago
MadryLab / implementation-matters
☆132Updated 11 months ago
dtak / mbrl-smdp-ode
PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020
☆42Updated 4 years ago
waterhorse1 / NAC
(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.
☆28Updated 3 years ago
ryan-dorazio / mmd-dilated
An implementation of the QRE solver magnetic mirror descent with dilated entropy (MMD).
☆8Updated 2 years ago
Miffyli / rl-action-space-shaping
Experiment code for testing effect of various action space transformations in reinforcement learning
☆30Updated 5 years ago
google-research / deep_ope
☆86Updated 11 months ago
wyjung0625 / p3s
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
☆22Updated 5 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆52Updated 2 years ago
sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆95Updated 3 years ago
CausalRL / DRL
Deconfounding Reinforcement Learning in Observational Settings
☆52Updated 6 years ago
xkianteb / ApproPO
Reinforcement Learning with Convex Constraints
☆14Updated 3 years ago
dido1998 / CausalMBRL
Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
☆48Updated 4 years ago
ElisevanderPol / symmetrizer
☆31Updated 4 years ago
robintyh1 / neurips2021-meta-gradient-offpolicy-evaluation
Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021
☆12Updated 3 years ago
JBLanier / stratego_env
Multi-Agent RL Environment for the Stratego Board Game (and variants)
☆34Updated last year
lanyavik / BAIL
☆17Updated 3 years ago
social-dilemma / multiagent
Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas
☆49Updated 2 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
huanzhang12 / ATLA_robust_RL
Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework
☆67Updated 4 years ago
martyput / MDP_book
☆122Updated 2 months ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago