pemami4911 / sinkhorn-policy-gradient.pytorchView external linksLinks
Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"
☆40Aug 27, 2018Updated 7 years ago
Alternatives and similar repositories for sinkhorn-policy-gradient.pytorch
Users that are interested in sinkhorn-policy-gradient.pytorch are comparing it to the libraries listed below
Sorting:
- Implementation of Counterfactual risk minimization☆26Apr 13, 2017Updated 8 years ago
- The package is developed for treatment recommendation & pairwise treatment individual effect estimation (ITE/CATE/HTE) when multiple trea…☆11Mar 9, 2023Updated 2 years ago
- ☆10Oct 8, 2018Updated 7 years ago
- ☆11Jun 8, 2020Updated 5 years ago
- Learning generative models with Sinkhorn Loss☆30Nov 9, 2018Updated 7 years ago
- Graph Generation Grammar☆13Apr 10, 2023Updated 2 years ago
- Statistics on the space of asymmetric networks via Gromov-Wasserstein distance☆15Jun 13, 2020Updated 5 years ago
- This is the source code for solving the Traveling Salesman Problems (TSP) using Monte Carlo tree search (MCTS).☆34Sep 25, 2019Updated 6 years ago
- ☆15Apr 7, 2019Updated 6 years ago
- Codes for Stackelberg GAN☆15Apr 23, 2019Updated 6 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆18Nov 10, 2017Updated 8 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Feb 4, 2020Updated 6 years ago
- ☆16Sep 4, 2018Updated 7 years ago
- TensorFlow examples☆23May 10, 2017Updated 8 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- ☆22Jul 25, 2023Updated 2 years ago
- Code used for the Arvix report: The Case for Automatic Database Administration using Deep Reinforcement Learning☆25May 13, 2020Updated 5 years ago
- Implementation of Neural Episodic Control in Tensorflow☆27May 16, 2019Updated 6 years ago
- ☆25Oct 31, 2020Updated 5 years ago
- ☆26Jun 17, 2022Updated 3 years ago
- Feasible target propagation code for the paper "Deep Learning as a Mixed Convex-Combinatorial Optimization Problem" by Friesen & Domingos…☆28Apr 12, 2018Updated 7 years ago
- Python term rewriting☆30Feb 14, 2013Updated 13 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Jun 28, 2019Updated 6 years ago
- Experiments of ACL 2018 paper box embeddings☆31Dec 5, 2018Updated 7 years ago
- Stochastic Unit Commitment for Renewable Energy Supply using Lagrangian Decomposition☆33Jun 3, 2018Updated 7 years ago
- Code for NIPS 2017 spotlight paper: "Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration" by Jason Alt…☆31Jan 4, 2018Updated 8 years ago
- ☆14Aug 20, 2025Updated 5 months ago
- Tensorflow implementation of preconditioned stochastic gradient descent☆34Nov 23, 2023Updated 2 years ago
- ☆77Sep 18, 2017Updated 8 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- Codes for building an AI-native database☆76Jul 29, 2024Updated last year
- blah☆35May 5, 2019Updated 6 years ago
- This project focuses on using deep learning to replace text in images while retaining the same font and style.☆10Dec 9, 2019Updated 6 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- A Cython library to solve the Bittensor registration POW on CUDA☆15Aug 15, 2025Updated 6 months ago
- ☆13May 21, 2024Updated last year
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Redefining Video Management with power of SQL☆11Oct 15, 2023Updated 2 years ago
- ☆10Feb 17, 2019Updated 7 years ago