Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"
☆40Aug 27, 2018Updated 7 years ago
Alternatives and similar repositories for sinkhorn-policy-gradient.pytorch
Users that are interested in sinkhorn-policy-gradient.pytorch are comparing it to the libraries listed below
Sorting:
- A Random Matrix Approach to Extreme Learning Machine☆15Feb 23, 2018Updated 8 years ago
- ☆11Jun 8, 2020Updated 5 years ago
- Learning generative models with Sinkhorn Loss☆30Nov 9, 2018Updated 7 years ago
- ☆15Apr 7, 2019Updated 6 years ago
- Codes for Stackelberg GAN☆15Apr 23, 2019Updated 6 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Feb 4, 2020Updated 6 years ago
- Code for the 2-simplicial Transformer paper☆21Jan 16, 2020Updated 6 years ago
- ☆16Sep 4, 2018Updated 7 years ago
- TensorFlow examples☆23May 10, 2017Updated 8 years ago
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆29Aug 9, 2018Updated 7 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- Code used for the Arvix report: The Case for Automatic Database Administration using Deep Reinforcement Learning☆25May 13, 2020Updated 5 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Jun 28, 2019Updated 6 years ago
- Python implementation of projection losses.☆27Nov 18, 2019Updated 6 years ago
- Stochastic Unit Commitment for Renewable Energy Supply using Lagrangian Decomposition☆33Jun 3, 2018Updated 7 years ago
- ☆14Aug 20, 2025Updated 6 months ago
- ☆77Sep 18, 2017Updated 8 years ago
- Solves a Mixed Integer Linear Program to generate the Stacklberg Equilibrium of a General-sum (+Bayesian) Games.☆36Jan 9, 2026Updated 2 months ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- Codes for building an AI-native database☆76Jul 29, 2024Updated last year
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 5 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- ☆13May 21, 2024Updated last year
- ☆10Feb 17, 2019Updated 7 years ago
- Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design☆23Jan 9, 2026Updated 2 months ago
- Redefining Video Management with power of SQL☆11Oct 15, 2023Updated 2 years ago
- ☆12Jul 7, 2022Updated 3 years ago
- An (abridged) time series of Aave wallet health factors (and associated token counts, prices, liquidation thresholds)☆11Jul 14, 2022Updated 3 years ago
- Machine Learning for Mathematical Formalization☆11Jul 20, 2024Updated last year
- Polynomial semantics of linear logic☆13Apr 15, 2018Updated 7 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- Dynamic config system based on python classes☆12Jan 27, 2023Updated 3 years ago
- This project focuses on using deep learning to replace text in images while retaining the same font and style.☆10Dec 9, 2019Updated 6 years ago
- [ICLR 2019] Learning Representations of Sets through Optimized Permutations☆36May 1, 2019Updated 6 years ago
- Optimizing control variates for black-box gradient estimation☆163Jul 26, 2019Updated 6 years ago
- Codes for NIPS 2019 Paper: Rethinking Kernel Methods for Node Representation Learning on Graphs☆34Feb 20, 2020Updated 6 years ago
- Code for the paper "Batch size invariance for policy optimization"☆57Apr 2, 2023Updated 2 years ago
- RL Experiments from our paper "Backpropagation Through the Void": https://arxiv.org/abs/1711.00123. Lovingly forked from OpenAI's RL Base…☆39Feb 21, 2018Updated 8 years ago