mechanism-learning-research / two-player-auctionsLinks
JAX/Haiku implementation of "Auction Learning as a Two-Player Game"
☆11Updated last year
Alternatives and similar repositories for two-player-auctions
Users that are interested in two-player-auctions are comparing it to the libraries listed below
Sorting:
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago
- ☆14Updated 6 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- ☆17Updated 4 years ago
- Code for doubly stochastic gradients☆25Updated 10 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Updated 5 years ago
- PreferenceNet: Encoding Human Preferences in Auction Design With Deep Learning☆16Updated 4 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 6 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- PyTorch implementation of Probabilistic Network Ensembles on toy problems☆23Updated 2 years ago
- Generalised UDRL☆37Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- PyTorch implementation of efficient algorithms for DRO with CVaR and Chi-Square uncertainty sets☆61Updated 2 years ago
- using information theory to encourage agents to cooperate and compete☆19Updated 6 years ago
- ☆19Updated 4 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- ☆12Updated 2 months ago
- Variational Reinforcement Learning☆16Updated last year
- Code and data for decision making under strategic behavior, NeurIPS 2020 & Management Science 2024.☆29Updated last year
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- Contextual Bandits Action Elimination DQN☆21Updated 7 years ago
- ☆86Updated last year
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- Representation Learning in RL☆14Updated 3 years ago
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆59Updated 10 months ago
- Robust policy search algorithms which train on model ensembles☆30Updated 8 years ago
- Project on Causal Machine learning CS 7290☆16Updated 5 years ago
- Structural Causal Bandit☆25Updated 3 years ago