mechanism-learning-research / two-player-auctionsLinks
JAX/Haiku implementation of "Auction Learning as a Two-Player Game"
☆11Updated last year
Alternatives and similar repositories for two-player-auctions
Users that are interested in two-player-auctions are comparing it to the libraries listed below
Sorting:
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 6 years ago
- ☆14Updated 6 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Variational Reinforcement Learning☆17Updated last year
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Updated 6 years ago
- ☆19Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Updated 3 years ago
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆59Updated last year
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 3 years ago
- PyTorch implementation of efficient algorithms for DRO with CVaR and Chi-Square uncertainty sets☆63Updated 3 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 7 years ago
- Ranking Policy Gradient☆23Updated 6 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Updated 3 years ago
- Representation Learning in RL☆13Updated 3 years ago
- ☆18Updated 4 years ago
- Code to reproduce results on toy tasks and companion blog for the paper.☆21Updated 3 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 3 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 7 years ago
- PyTorch implementation of Probabilistic Network Ensembles on toy problems☆23Updated 2 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆37Updated 5 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 6 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Updated 4 years ago
- ☆89Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 7 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 3 years ago