timvieira / rl
Reference implementation of algorithms for reinforcement learning and Markov decision processes.
☆11Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for rl
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆16Updated 7 years ago
- Variable-order CRFs with structure learning☆16Updated 3 months ago
- ☆18Updated 2 years ago
- [DEPRECATED] Scoring models for the task of Automatic Knowledge Base Completion implemented with TensorFlow.☆10Updated 8 years ago
- Variational Walkback, NIPS'17☆28Updated 7 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆49Updated 6 years ago
- Analogs of Linguistic Structure in Deep Representations☆19Updated 7 years ago
- ☆30Updated 2 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- Code for a generative controller for the AI Gym cartpole task☆15Updated 7 years ago
- Implementing FastSent in theano☆12Updated 8 years ago
- ☆17Updated 6 years ago
- Relevant code for the "Show Your Work" paper, EMNLP 2019.☆18Updated 5 years ago
- DyNet implementation of stack LSTM experiments by Grefenstette et al.☆21Updated 7 years ago
- LP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs☆41Updated last year
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Updated 5 years ago
- ☆10Updated 9 years ago
- Python implementation of projection losses.☆25Updated 5 years ago
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Updated 6 years ago
- A generic Monte Carlo method based on the Gumbel-Max trick.☆32Updated 8 years ago
- Code for the publication Learning to Reason with Third-Order Tensor Products.☆39Updated 5 years ago
- ☆45Updated 5 years ago
- ☆26Updated 5 years ago
- Learning algorithms introduced in "A PAC-Bayes Sample Compression Approach to Kernel Methods" (ICML 2011)☆9Updated 10 years ago
- ☆50Updated 6 years ago
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Updated 7 years ago
- Towards cross-lingual distributed representations without parallel text trained with adversarial autoencoders☆22Updated 8 years ago
- Tensorflow Implementation of Multi-Function Recurrent Unit☆24Updated 8 years ago
- Learning to Prune: Exploring the Frontier of Fast and Accurate Parsing☆22Updated last month