timvieira / rlLinks

Reference implementation of algorithms for reinforcement learning and Markov decision processes.

☆12

Alternatives and similar repositories for rl

Users that are interested in rl are comparing it to the libraries listed below

Sorting:

matejbalog / gumbel-relatives
Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick
☆17Updated 8 years ago
harvardnlp / strux
☆18Updated 3 years ago
hal3 / macarico
learning to search in pytorch
☆110Updated 5 years ago
rizar / systematic-generalization-sqoop
Code for "Systematic Generalization: What Is Required and Can It Be Learned"
☆37Updated 6 years ago
UCLA-StarAI / LearnPSDD
☆15Updated 6 years ago
stanis-morozov / prodige
A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.
☆47Updated 5 years ago
cmaddis / astar-sampling
A generic Monte Carlo method based on the Gumbel-Max trick.
☆32Updated 9 years ago
viking-sudo-rm / stacknn-core
Pip-installable differentiable stacks in PyTorch!
☆65Updated 4 years ago
jacobandreas / neuralese
Translating neuralese
☆44Updated 8 years ago
google-research / autoconj
Recognizing and exploiting conjugacy without a domain-specific language
☆36Updated 5 years ago
uclnlp / adversarial-nli
Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."
☆25Updated 6 years ago
anirudh9119 / walkback_nips17
Variational Walkback, NIPS'17
☆28Updated 7 years ago
deep-spin / lp-sparsemap
LP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs
☆41Updated last year
BorealisAI / lite_tracer
a light weight experiment reproducibility toolset
☆40Updated 4 years ago
wittawatj / kernel-mod
NeurIPS 2018. Linear-time model comparison tests.
☆18Updated 5 years ago
RemiLeblond / SeaRNN-open
Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)
☆48Updated 7 years ago
srush / jax-lda
☆30Updated 3 years ago
harvardnlp / hmm-lm
☆41Updated 4 years ago
mblondel / projection-losses
Python implementation of projection losses.
☆27Updated 5 years ago
Sohl-Dickstein / Minimum-Probability-Flow-Learning
Matlab code implementing Minimum Probability Flow Learning.
☆69Updated 10 years ago
elanmart / psmm
☆49Updated 7 years ago
ischlag / TPR-RNN
Code for the publication Learning to Reason with Third-Order Tensor Products.
☆41Updated 6 years ago
ForoughA / neuralMath
Combining Symbolic and Function Evaluation Expressions In Neural Programs
☆34Updated 5 years ago
ermongroup / BiasAndGeneralization
☆26Updated 6 years ago
timvieira / learning-to-prune
Learning to Prune: Exploring the Frontier of Fast and Accurate Parsing
☆22Updated 10 months ago
plai-group / covid
Modelling epidemiological dynamics and performing inference in these models
☆27Updated 4 years ago
timvieira / spanning_tree
A reference implementation of algorithms for distributions over spanning trees.
☆21Updated 5 years ago
timvieira / vocrf
Variable-order CRFs with structure learning
☆16Updated last year
vene / sparsemap
SparseMAP: differentiable sparse structure inference
☆112Updated 6 years ago
kyunghyuncho / backprop-kalman-filter
☆45Updated 5 years ago