akxlr / tbp
Tensor Belief Propagation - algorithm for approximate inference in discrete graphical models
☆12Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for tbp
- Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"☆39Updated 6 years ago
- A branch-and-bound ILP solver☆26Updated 5 years ago
- Code for "Efficient optimization of loops and limits with randomized telescoping sums"☆27Updated 5 years ago
- Sum-Product Network learning routines in python☆26Updated 9 years ago
- Feasible target propagation code for the paper "Deep Learning as a Mixed Convex-Combinatorial Optimization Problem" by Friesen & Domingos…☆28Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Logistic Circuits☆35Updated 5 years ago
- ☆22Updated 3 years ago
- IPC: A Graph Data Set Compiled from International Planning Competitions☆44Updated 5 years ago
- Regularization, Neural Network Training Dynamics☆14Updated 4 years ago
- Matrix exponential in cuda for pytorch and tensorflow☆16Updated 5 years ago
- Implementation of the paper "Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement Learning".☆25Updated 4 years ago
- PDP: A General Neural Framework for Learning Constraint Satisfaction Solvers☆40Updated last year
- Example implementation of the Bayesian neural network in "Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteri…☆31Updated 4 years ago
- Lagrangian VAE☆28Updated 6 years ago
- ☆26Updated 5 years ago
- Upper Confidence Tree Planner for ATARI games☆19Updated 8 years ago
- Reinforcement Learning with Convex Constraints☆14Updated 2 years ago
- RL Experiments from our paper "Backpropagation Through the Void": https://arxiv.org/abs/1711.00123. Lovingly forked from OpenAI's RL Base…☆38Updated 6 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆44Updated last year
- Sum product algorithm - Belief propagation (message passing) for factor graphs☆87Updated 6 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- Code for doubly stochastic gradients☆25Updated 10 years ago
- Code to minimize the Variational Contrastive Divergence (VCD)☆28Updated 5 years ago
- Code release for the ICLR paper☆20Updated 6 years ago
- code for "Quantile Stein Variational Gradient Descent"☆9Updated 5 years ago
- ☆26Updated 6 years ago
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆20Updated 5 years ago
- The Python PSDD Package☆16Updated 2 months ago
- Natural Gradient, Variational Inference☆29Updated 4 years ago