timvieira / rl
Reference implementation of algorithms for reinforcement learning and Markov decision processes.
☆12Updated 4 years ago
Alternatives and similar repositories for rl:
Users that are interested in rl are comparing it to the libraries listed below
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Updated 7 years ago
- ☆18Updated 3 years ago
- ☆30Updated 3 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆48Updated 6 years ago
- Variational Walkback, NIPS'17☆28Updated 7 years ago
- A generic Monte Carlo method based on the Gumbel-Max trick.☆32Updated 8 years ago
- Code for the benchmark containing dataset, models and metrics for productive concept learning -- a kind of compositional reasoning task t…☆17Updated 3 years ago
- Analogs of Linguistic Structure in Deep Representations☆19Updated 7 years ago
- Code for a generative controller for the AI Gym cartpole task☆15Updated 8 years ago
- Code for the publication Learning to Reason with Third-Order Tensor Products.☆40Updated 6 years ago
- Python implementation of projection losses.☆26Updated 5 years ago
- Variable-order CRFs with structure learning☆16Updated 9 months ago
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Updated 6 years ago
- Code for "Systematic Generalization: What Is Required and Can It Be Learned"☆37Updated 6 years ago
- Modelling epidemiological dynamics and performing inference in these models☆27Updated 3 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- LP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs☆41Updated last year
- ☆15Updated 6 years ago
- [DEPRECATED] Scoring models for the task of Automatic Knowledge Base Completion implemented with TensorFlow.☆9Updated 8 years ago
- NeurIPS 2018. Linear-time model comparison tests.☆18Updated 5 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Updated 5 years ago
- ☆42Updated 4 years ago
- Code for "Efficient optimization of loops and limits with randomized telescoping sums"☆27Updated 5 years ago
- Translating neuralese☆44Updated 8 years ago
- Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."☆25Updated 6 years ago
- ☆49Updated 7 years ago
- Code repo for "Transformer on a Diet" paper☆31Updated 4 years ago
- Learning to Prune: Exploring the Frontier of Fast and Accurate Parsing☆22Updated 7 months ago
- ☆10Updated 9 years ago
- Finding Generalizable Evidence by Learning to Convince Q&A Models☆25Updated 2 years ago