zt95 / infinite-horizon-off-policy-estimationView external linksLinks
☆13Apr 3, 2019Updated 6 years ago
Alternatives and similar repositories for infinite-horizon-off-policy-estimation
Users that are interested in infinite-horizon-off-policy-estimation are comparing it to the libraries listed below
Sorting:
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆24Apr 14, 2022Updated 3 years ago
- The code accompaniment for the CoRL 2020 paper: A User's Guide to Calibrating Robotics Simulators (https://arxiv.org/abs/2011.08985), fro…☆31Nov 20, 2020Updated 5 years ago
- An easy-to-use jekyll theme for creating a workshop webpage (useful for AI / ML / CV / robotics folks)☆28Jan 3, 2021Updated 5 years ago
- ☆17Oct 30, 2025Updated 3 months ago
- Datacenter simulation toolkit for the OpenDC project☆10Aug 24, 2020Updated 5 years ago
- Pythonによる制御工学入門改訂2版☆12Aug 22, 2024Updated last year
- sc14 matlab application☆14Nov 24, 2014Updated 11 years ago
- code for polite☆11Feb 28, 2024Updated last year
- NeurIPS 2020 Spotlight Paper☆13Dec 20, 2021Updated 4 years ago
- ☆22Jan 12, 2026Updated last month
- ☆11May 13, 2019Updated 6 years ago
- ☆10Jul 29, 2022Updated 3 years ago
- This is the repository holding the code used to perform the analysis used in the manuscript "Machine learning in policy evaluation: new t…☆11Sep 12, 2019Updated 6 years ago
- MMD-FUSE package implementing the MMD-FUSE test proposed in MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data …☆11May 31, 2024Updated last year
- A small library of 3D related utilities used in my research.☆10Mar 5, 2022Updated 3 years ago
- Incredible user-friendly seq2seq API and CLI app with beam search, bidirectional, attention, bucket in just one single file☆12Sep 16, 2018Updated 7 years ago
- ☆11Jul 13, 2018Updated 7 years ago
- ☆14Jan 27, 2026Updated 3 weeks ago
- ☆11Feb 11, 2024Updated 2 years ago
- Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping☆10Sep 11, 2020Updated 5 years ago
- NLNS+VND Metaheuristic Algorithm for solving Combinatorial Optimization Problems☆10Jul 4, 2017Updated 8 years ago
- MuJoCo model for Blue☆10Mar 13, 2020Updated 5 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- ☆10Sep 23, 2019Updated 6 years ago
- Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation☆11Jul 26, 2016Updated 9 years ago
- Simple implementation of dynamic movement primitives (DMP) in python☆11Jun 23, 2013Updated 12 years ago
- Implementation of Johansson, Fredrik D., Shalit, Uri, and Sontag, David. Learning representations for counterfactual inference - ICML, 20…☆12Sep 30, 2020Updated 5 years ago
- A Repo focusing on Engineering Physics Applications of MLX☆12Oct 8, 2024Updated last year
- PID-like control implemented as active inference with linear generative models☆11Jul 2, 2020Updated 5 years ago
- nimo☆20Feb 11, 2026Updated last week
- ☆10Jul 27, 2023Updated 2 years ago
- Official implementation of the ICLR 2021 paper "Differentiable Trust Region Layers for Deep Reinforcement Learning"☆11Aug 23, 2023Updated 2 years ago
- Pixyz Tutorial in RL Architecture Study Group☆11Apr 25, 2019Updated 6 years ago
- Differentiable MPC in Chainer, developed as part of PFN summer internship 2019.☆15Aug 23, 2022Updated 3 years ago
- The source code of our DCB algorithm in KDD17 paper: Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing☆10Jun 25, 2018Updated 7 years ago
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- Python implementation of the BRIM algorithm for bipartite community structure detection.☆12Aug 11, 2022Updated 3 years ago
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- ☆11Aug 27, 2018Updated 7 years ago