Reference implementation of algorithms for reinforcement learning and Markov decision processes.
☆12Jan 28, 2021Updated 5 years ago
Alternatives and similar repositories for rl
Users that are interested in rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Repo for the EACL2017 tutorial on imitation learning☆28Apr 3, 2017Updated 8 years ago
- MiTextExplorer - interactive browser of text and document covariates.☆24Jun 17, 2015Updated 10 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 8 years ago
- ☆12Jan 9, 2019Updated 7 years ago
- A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.☆13Aug 28, 2023Updated 2 years ago
- A Declarative Theorem Prover for First-Order Classical Logic☆29Jun 14, 2024Updated last year
- ☆12Sep 30, 2018Updated 7 years ago
- Interactive tutorial on the Forward-Backward Expectation Maximization algorithm☆30Dec 15, 2015Updated 10 years ago
- NeurIPS 2018. Linear-time model comparison tests.☆18Feb 15, 2020Updated 6 years ago
- A simulation environment for the creation and observation of ML models based on PyTorch☆12Mar 22, 2019Updated 7 years ago
- ☆25May 20, 2020Updated 5 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- EWoK dataset generation framework☆10May 14, 2024Updated last year
- RL framework for embodied agents based on PyTorch☆11Apr 11, 2019Updated 6 years ago
- Learning to Prune: Exploring the Frontier of Fast and Accurate Parsing☆22Sep 24, 2024Updated last year
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- An Adaptor Grammar model implementation in Python.☆17Jan 31, 2020Updated 6 years ago
- This code accompanies the paper "Bayesian Framework for Information-Theoretic Probing" published in EMNLP 2021.☆10Aug 23, 2021Updated 4 years ago
- ☆12Jul 25, 2018Updated 7 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆11Sep 16, 2021Updated 4 years ago
- Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"☆12Apr 23, 2021Updated 4 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆35Aug 25, 2016Updated 9 years ago
- Code and data for Veridicality classifier on Twitter☆11May 23, 2018Updated 7 years ago
- Replication Materials for "Crowd-Sourced Text Analysis" APSR (2016) 110(2): 278-295.☆11Oct 28, 2017Updated 8 years ago
- An interactive tool for analyzing, executing, and improving dynamic programming algorithms.☆20Jan 30, 2026Updated last month
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- Tweets annotated with coarse-grained sense labels (supersenses)☆13Jun 13, 2014Updated 11 years ago
- simple version of coptermanager☆13Sep 20, 2014Updated 11 years ago
- JavaScript Hiccup compiler☆51May 27, 2022Updated 3 years ago
- Dyna built on R-exprs (First Prototype)☆17Mar 7, 2022Updated 4 years ago
- Nearest Neighbor Search in High Dimensional Spaces☆13Nov 18, 2015Updated 10 years ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- Models, scripts, and data sets for data annotation (aka coding, aka rating)☆12Mar 9, 2015Updated 11 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Source code for ScaleGrad☆19Dec 28, 2021Updated 4 years ago
- Code for MSID, a Multi-Scale Intrinsic Distance for comparing generative models, studying neural networks, and more!☆52May 29, 2019Updated 6 years ago