mechanism-learning-research / two-player-auctionsLinks

JAX/Haiku implementation of "Auction Learning as a Two-Player Game"

☆11

Alternatives and similar repositories for two-player-auctions

Users that are interested in two-player-auctions are comparing it to the libraries listed below

Sorting:

YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 5 years ago
YyzHarry / SV-RL
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Updated 5 years ago
uber-research / Evolvability-ES
☆14Updated 6 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
guaguakai / surrogate-optimization-learning
☆17Updated 4 years ago
zixu1986 / Doubly_Stochastic_Gradients
Code for doubly stochastic gradients
☆25Updated 10 years ago
AnujMahajanOxf / VIREL
Code for VIREL: A Variational Inference Framework for Reinforcement Learning
☆14Updated 5 years ago
neeharperi / PreferenceNet
PreferenceNet: Encoding Human Preferences in Auction Design With Deep Learning
☆16Updated 4 years ago
krasheninnikov / max-causal-ent-irl
Maximum Causal Entropy Inverse Reinforcement Learning
☆48Updated 6 years ago
david-abel / rl_info_theory
A collection of code investigating the use of information theory for abstractions in RL
☆16Updated 6 years ago
johannesnauta / pytorch-pne
PyTorch implementation of Probabilistic Network Ensembles on toy problems
☆23Updated 2 years ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
daniellevy / fast-dro
PyTorch implementation of efficient algorithms for DRO with CVaR and Chi-Square uncertainty sets
☆61Updated 2 years ago
djstrouse / InfoMARL
using information theory to encourage agents to cooperate and compete
☆19Updated 6 years ago
tianjunz / MADE
☆19Updated 4 years ago
illidanlab / rpg
Ranking Policy Gradient
☆23Updated 5 years ago
CausalML / DoubleReinforcementLearningMDP
☆12Updated 2 months ago
joelouismarino / variational_rl
Variational Reinforcement Learning
☆16Updated last year
Networks-Learning / strategic-decisions
Code and data for decision making under strategic behavior, NeurIPS 2020 & Management Science 2024.
☆29Updated last year
facebookresearch / taskmet
TaskMet Task-driven Metric Learning for Model Learning
☆19Updated last year
TomZahavy / CB_AE_DQN
Contextual Bandits Action Elimination DQN
☆21Updated 7 years ago
google-research / deep_ope
☆86Updated last year
thanhnguyentang / offline_neural_bandits
An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…
☆13Updated 3 years ago
epignatelli / discovering-reinforcement-learning-algorithms
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…
☆22Updated 4 years ago
yudasong / briee
Representation Learning in RL
☆14Updated 3 years ago
mila-iqia / Conscious-Planning
Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".
☆59Updated 10 months ago
aravindr93 / robustRL
Robust policy search algorithms which train on model ensembles
☆30Updated 8 years ago
nanavatirutu / CausalBandits
Project on Causal Machine learning CS 7290
☆16Updated 5 years ago
sanghack81 / SCMMAB-NIPS2018
Structural Causal Bandit
☆25Updated 3 years ago