timvieira / rlLinks
Reference implementation of algorithms for reinforcement learning and Markov decision processes.
☆12Updated 4 years ago
Alternatives and similar repositories for rl
Users that are interested in rl are comparing it to the libraries listed below
Sorting:
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Updated 7 years ago
- ☆18Updated 3 years ago
- Code for the benchmark containing dataset, models and metrics for productive concept learning -- a kind of compositional reasoning task t…☆17Updated 3 years ago
- ☆30Updated 3 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆48Updated 6 years ago
- Variational Walkback, NIPS'17☆28Updated 7 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- Python implementation of projection losses.☆26Updated 5 years ago
- Variable-order CRFs with structure learning☆16Updated 10 months ago
- Analogs of Linguistic Structure in Deep Representations☆19Updated 7 years ago
- ☆15Updated 6 years ago
- Template-DQN and DRRN agent implementations☆22Updated last year
- Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."☆25Updated 6 years ago
- Code for the publication Learning to Reason with Third-Order Tensor Products.☆40Updated 6 years ago
- ☆17Updated 7 years ago
- DyNet implementation of stack LSTM experiments by Grefenstette et al.☆21Updated 7 years ago
- NeurIPS 2018. Linear-time model comparison tests.☆18Updated 5 years ago
- A reference implementation of algorithms for distributions over spanning trees.☆21Updated 5 years ago
- Matlab code implementing Minimum Probability Flow Learning.☆69Updated 10 years ago
- Learning to Prune: Exploring the Frontier of Fast and Accurate Parsing☆22Updated 8 months ago
- Tensor product decomposition network☆19Updated 4 years ago
- Modelling epidemiological dynamics and performing inference in these models☆27Updated 3 years ago
- ☆49Updated 7 years ago
- ☆42Updated 4 years ago
- A list of resources dedicated to compositionality☆14Updated 6 years ago
- ☆26Updated 6 years ago
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Updated 6 years ago
- A generic Monte Carlo method based on the Gumbel-Max trick.☆32Updated 9 years ago
- Code for "Systematic Generalization: What Is Required and Can It Be Learned"☆37Updated 6 years ago
- ProBO: Versatile Bayesian Optimization Using Any Probabilistic Programming Language☆15Updated 5 years ago