heidekrueger / bnelearn
A Framework for Equilibrium Learning in Sealed-Bid Auctions
☆24Updated 2 years ago
Alternatives and similar repositories for bnelearn
Users that are interested in bnelearn are comparing it to the libraries listed below
Sorting:
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 2 years ago
- ☆43Updated 3 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Updated 5 years ago
- ☆17Updated 2 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆29Updated 2 years ago
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- Reinforcement Learning with Convex Constraints☆14Updated 3 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆14Updated 3 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- on-policy optimization baselines for deep reinforcement learning☆30Updated 5 years ago
- Code for magnetic mirror descent.☆16Updated last year
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆25Updated 3 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 3 years ago
- ☆12Updated last week
- ☆31Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆47Updated 2 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆33Updated 2 years ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆25Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- JAX/Haiku implementation of "Auction Learning as a Two-Player Game"☆10Updated 10 months ago
- ☆32Updated 9 months ago
- ☆85Updated 9 months ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Updated 4 years ago
- ☆117Updated last week
- Efficient Exploration through Bayesian Deep-Q Networks.☆17Updated 3 years ago
- An implementation of the QRE solver magnetic mirror descent with dilated entropy (MMD).☆8Updated 2 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆17Updated 3 years ago
- Revisiting Rainbow☆74Updated 3 years ago