zt95/infinite-horizon-off-policy-estimation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zt95/infinite-horizon-off-policy-estimation)

zt95 / infinite-horizon-off-policy-estimation

☆13

Alternatives and similar repositories for infinite-horizon-off-policy-estimation

Users that are interested in infinite-horizon-off-policy-estimation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

atlarge-research / opendc-simulator
View on GitHub
Datacenter simulation toolkit for the OpenDC project
☆10Aug 24, 2020Updated 5 years ago
ahsan-rahim / bai-fyp
View on GitHub
Breast Cancer Detection using Mask-rcnn on the inbreast dataset
☆13Dec 13, 2023Updated 2 years ago
aijunbai / hplanning
View on GitHub
Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation
☆11Jul 26, 2016Updated 10 years ago
DorianKodelja / DeepMind-Atari-Deep-Q-Learner-2Player
View on GitHub
☆13Nov 17, 2015Updated 10 years ago
stratisMarkou / sample-efficient-bayesian-rl
View on GitHub
Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL
☆25Apr 14, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
skypea / DAG_No_Fear
View on GitHub
NeurIPS 2020 Spotlight Paper
☆13Dec 20, 2021Updated 4 years ago
DuaneNielsen / rnd
View on GitHub
Exploration by Random Network Distillation
☆15Dec 30, 2018Updated 7 years ago
astier / model-free-episodic-control
View on GitHub
Model-Free-Episodic-Control implementation.
☆17Jun 3, 2019Updated 7 years ago
NVlabs / sim-parameter-estimation
View on GitHub
The code accompaniment for the CoRL 2020 paper: A User's Guide to Calibrating Robotics Simulators (https://arxiv.org/abs/2011.08985), fro…
☆30Nov 20, 2020Updated 5 years ago
neale / avoiding-side-effects
View on GitHub
Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments
☆12Jun 3, 2021Updated 5 years ago
montrealrobotics / unsupervised-adr
View on GitHub
Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL
☆12Aug 4, 2020Updated 5 years ago
veronicachelu / temporal_abstraction
View on GitHub
Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…
☆24Nov 29, 2018Updated 7 years ago
krrish94 / sniffle-workshop
View on GitHub
An easy-to-use jekyll theme for creating a workshop webpage (useful for AI / ML / CV / robotics folks)
☆28Jan 3, 2021Updated 5 years ago
ethanhe42 / Continuous-Energy-Minimization-for-Multitarget-Tracking
View on GitHub
Continuous Energy Minimization for Multitarget Tracking
☆20Feb 9, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
chanind / causal-tracer
View on GitHub
Causal tracing for language models
☆12Apr 2, 2024Updated 2 years ago
Luonic / tf-cnn-lstm-ocr-captcha
View on GitHub
Code for training LSTM neural network on top of convolutional features for captcha recognition in Moscow subway
☆11Aug 8, 2017Updated 8 years ago
Declancharrison / Level-Set-Boosting
View on GitHub
☆10Jul 27, 2023Updated 3 years ago
kpot / kerl
View on GitHub
KERL: reinforcement learning algorithms and tools implemented using Keras
☆11Aug 2, 2024Updated last year
ewanlee / ICLR2019-RL-Papers
View on GitHub
The Reinforcement-Learning-Related Papers of ICLR 2019
☆47May 28, 2019Updated 7 years ago
caus-am / aci
View on GitHub
Ancestral Causal Inference (ACI)
☆14May 24, 2017Updated 9 years ago
Riashat / Bayesian-Exploration-Deep-RL
View on GitHub
Bayesian Uncertainty Exploration in Deep Reinforcement Learning
☆18Jul 12, 2017Updated 9 years ago
rucsgss / thesis
View on GitHub
LaTeX template for Rutgers University Computer Science thesis
☆23Nov 10, 2019Updated 6 years ago
SharathRaparthy / research-readings
View on GitHub
A personal project where I publish my research paper notes on a weekly basis.
☆13Jul 28, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
choo8 / Tensorflow-DeepMind-Atari-Deep-Q-Learner-2Player
View on GitHub
A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow
☆15Apr 27, 2018Updated 8 years ago
atupal / seismicCompetition
View on GitHub
sc14 matlab application
☆14Nov 24, 2014Updated 11 years ago
sorrge / ChatGPT_production
View on GitHub
Small projects made with ChatGPT
☆16Apr 15, 2024Updated 2 years ago
ankitkv / TD-VAE
View on GitHub
TD-VAE in PyTorch
☆10May 28, 2019Updated 7 years ago
antoninschrab / mmdfuse
View on GitHub
MMD-FUSE package implementing the MMD-FUSE test proposed in MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data …
☆12May 31, 2024Updated 2 years ago
agramfort / DS3_practical_optim_for_ml
View on GitHub
Notebooks from DS3 course on practical optimization
☆15Jan 5, 2021Updated 5 years ago
SharonBrizinov / PickTime
View on GitHub
PickTime Chrome Extension - extract myvisit tokens and send to PickTime bot
☆13May 16, 2022Updated 4 years ago
jqhoogland / obsidian-squiggle
View on GitHub
Obsidian Plugin to execute squiggle in a note.
☆26Sep 25, 2022Updated 3 years ago
fishmoon1234 / Nonlocal-Attention-Operator
View on GitHub
Attention mechanism-based neural operator models to solve both forward and inverse problems.
☆17May 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ChunyuanLI / RAS
View on GitHub
AISTATS 2019: Reference-based Adversarial Sampling & Its applications to Soft Q-learning
☆15Jan 21, 2019Updated 7 years ago
illidanlab / cdrl
View on GitHub
Collaborative Deep Reinforcement Learning
☆32Jul 29, 2017Updated 9 years ago
davidrpugh / pyCollocation
View on GitHub
Python package for solving initial value problems (IVP) and two-point boundary value problems (2PBVP).
☆16Jul 20, 2016Updated 10 years ago
WalterBabyRudin / Courseware
View on GitHub
☆11Jan 12, 2021Updated 5 years ago
aosewski / RidgeDetection
View on GitHub
Parallel implementation of the ridge detection algorithm for curve reconstruction in CUDA
☆13Nov 21, 2017Updated 8 years ago
georgekatona / Clique
View on GitHub
Python implementation of the CLIQUE subspace clustering algorithm.
☆55Jul 6, 2023Updated 3 years ago
IliasZadik / double_orthogonal_ml
View on GitHub
☆10Jul 13, 2018Updated 8 years ago