☆13Apr 3, 2019Updated 6 years ago
Alternatives and similar repositories for infinite-horizon-off-policy-estimation
Users that are interested in infinite-horizon-off-policy-estimation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Datacenter simulation toolkit for the OpenDC project☆10Aug 24, 2020Updated 5 years ago
- Breast Cancer Detection using Mask-rcnn on the inbreast dataset☆13Dec 13, 2023Updated 2 years ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆24Apr 14, 2022Updated 3 years ago
- Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation☆11Jul 26, 2016Updated 9 years ago
- ☆13Nov 17, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- NeurIPS 2020 Spotlight Paper☆13Dec 20, 2021Updated 4 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- The code accompaniment for the CoRL 2020 paper: A User's Guide to Calibrating Robotics Simulators (https://arxiv.org/abs/2011.08985), fro…☆31Nov 20, 2020Updated 5 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- An easy-to-use jekyll theme for creating a workshop webpage (useful for AI / ML / CV / robotics folks)☆28Jan 3, 2021Updated 5 years ago
- Continuous Energy Minimization for Multitarget Tracking☆20Feb 9, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ICU-Sepsis is a lightweight, yet challenging RL environment that models the treatment of sepsis in the ICU.☆40Oct 23, 2024Updated last year
- Code for training LSTM neural network on top of convolutional features for captcha recognition in Moscow subway☆11Aug 8, 2017Updated 8 years ago
- Causal tracing for language models☆12Apr 2, 2024Updated last year
- KERL: reinforcement learning algorithms and tools implemented using Keras☆11Aug 2, 2024Updated last year
- ☆19Oct 30, 2025Updated 5 months ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47May 28, 2019Updated 6 years ago
- Attention mechanism-based neural operator models to solve both forward and inverse problems.☆16May 30, 2025Updated 10 months ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Jul 12, 2017Updated 8 years ago
- LaTeX template for Rutgers University Computer Science thesis☆23Nov 10, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Jul 27, 2023Updated 2 years ago
- A personal project where I publish my research paper notes on a weekly basis.☆13Jul 28, 2021Updated 4 years ago
- sc14 matlab application☆14Nov 24, 2014Updated 11 years ago
- A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow☆15Apr 27, 2018Updated 7 years ago
- Small projects made with ChatGPT☆16Apr 15, 2024Updated last year
- TD-VAE in PyTorch☆10May 28, 2019Updated 6 years ago
- MMD-FUSE package implementing the MMD-FUSE test proposed in MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data …☆11May 31, 2024Updated last year
- Notebooks from DS3 course on practical optimization☆15Jan 5, 2021Updated 5 years ago
- Ancestral Causal Inference (ACI)☆14May 24, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PickTime Chrome Extension - extract myvisit tokens and send to PickTime bot☆13May 16, 2022Updated 3 years ago
- Obsidian Plugin to execute squiggle in a note.☆26Sep 25, 2022Updated 3 years ago
- Python package for solving initial value problems (IVP) and two-point boundary value problems (2PBVP).☆16Jul 20, 2016Updated 9 years ago
- Collaborative Deep Reinforcement Learning☆32Jul 29, 2017Updated 8 years ago
- Parallel implementation of the ridge detection algorithm for curve reconstruction in CUDA☆12Nov 21, 2017Updated 8 years ago
- Python implementation of the CLIQUE subspace clustering algorithm.☆55Jul 6, 2023Updated 2 years ago
- ☆11Jul 13, 2018Updated 7 years ago