RonanFR/UCRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RonanFR/UCRL)

RonanFR / UCRL

☆27

Alternatives and similar repositories for UCRL

Users that are interested in UCRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kristychoi / pixel_exploration
View on GitHub
PyTorch implementation of Count-Based Exploration with Neural Density Models
☆10Mar 22, 2018Updated 8 years ago
MeckyWu / subspace-match
View on GitHub
☆16Oct 26, 2018Updated 7 years ago
Neo-X / SMiRL_Code
View on GitHub
☆20Nov 13, 2022Updated 3 years ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago
yoonholee / reinforcement-learning-papers
View on GitHub
My notes on reinforcement learning papers
☆15Jun 14, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zzyunzhi / asynch-mb
View on GitHub
(CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning
☆14Dec 27, 2022Updated 3 years ago
mcmachado / count_based_exploration_sr
View on GitHub
☆31Jul 1, 2019Updated 7 years ago
manantomar / Mirror-Descent-Policy-Optimization
View on GitHub
Mirror Descent Policy Optimization
☆43Oct 31, 2020Updated 5 years ago
paris-saclay-cds / python-workshop
View on GitHub
Materials for the Paris-Saclay Center for Data Science python workshop
☆17Jul 6, 2017Updated 9 years ago
bonetblai / qnp2fond
View on GitHub
Qualitative Numeric Planning
☆10Dec 10, 2020Updated 5 years ago
xuedong / machine-learning-summer-schools
View on GitHub
Curated materials for different machine learning related summer schools
☆19Mar 8, 2021Updated 5 years ago
aijunbai / hplanning
View on GitHub
Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation
☆11Jul 26, 2016Updated 9 years ago
DorianKodelja / DeepMind-Atari-Deep-Q-Learner-2Player
View on GitHub
☆13Nov 17, 2015Updated 10 years ago
Shallow-Updates-for-Deep-RL / Shallow_Updates_for_Deep_RL
View on GitHub
Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"
☆18Nov 2, 2017Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
AutumnWu / Streamlined-Off-Policy-Learning
View on GitHub
ICRL 2020
☆20Feb 18, 2020Updated 6 years ago
alexrutar / banditvis
View on GitHub
A Python 3 Bandit Visualization Package
☆11Oct 16, 2017Updated 8 years ago
anniesch / single-life-rl
View on GitHub
Single-Life Reinforcement Learning
☆14Dec 17, 2022Updated 3 years ago
mangaki / movielens
View on GitHub
Système de recommandation minimal sur Movielens (pour Girls Can Code! 2016)
☆16May 26, 2025Updated last year
zt95 / infinite-horizon-off-policy-estimation
View on GitHub
☆13Apr 3, 2019Updated 7 years ago
ArnaudFickinger / adversarial-surprise
View on GitHub
Explore and Control with Adversarial Surprise
☆10Jul 20, 2021Updated 5 years ago
rlseminar / rlseminar.github.io
View on GitHub
Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.
☆21Nov 17, 2023Updated 2 years ago
deerishi / Tic-Tac-Toe-Using-Alpha-Beta-Minimax-Search
View on GitHub
This code demonstrates the use of Alpha Beta Pruning for Game playing. Since, Tic Tac Toe has a depth of 9 , I use a heuristic function t…
☆11Mar 31, 2016Updated 10 years ago
DuaneNielsen / rnd
View on GitHub
Exploration by Random Network Distillation
☆15Dec 30, 2018Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
astier / model-free-episodic-control
View on GitHub
Model-Free-Episodic-Control implementation.
☆17Jun 3, 2019Updated 7 years ago
xuanlinli17 / iclr2021_rlreg
View on GitHub
Regularization Matters in Policy Optimization
☆21Nov 1, 2021Updated 4 years ago
CyberAgentAILab / regularized-bon
View on GitHub
Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).
☆14Apr 4, 2025Updated last year
strumswell / twitter-follower-graph
View on GitHub
Twitter follower graphs of @Die_Gruenen & @AfD, including cluster and topic analysis
☆10Jul 10, 2020Updated 6 years ago
ying-wen / gr2
View on GitHub
Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
☆14Dec 8, 2022Updated 3 years ago
yfletberliac / rlss-2019
View on GitHub
Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
☆90Aug 21, 2019Updated 6 years ago
jilljenn / business-card
View on GitHub
A business card in LaTeX
☆29Feb 11, 2017Updated 9 years ago
google-research / policy-learning-landscape
View on GitHub
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Jan 16, 2019Updated 7 years ago
jxu43 / replication-mbpo
View on GitHub
NeurIPS Reproducibility Challenge 2019
☆21Feb 25, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / reward-estimator-corl
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆23Oct 26, 2018Updated 7 years ago
tor / libbandit
View on GitHub
Library for Multi-Armed Bandit Algorithms
☆57Apr 2, 2017Updated 9 years ago
Santara / stochastic_value_gradient
View on GitHub
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆25Jan 15, 2022Updated 4 years ago
CatherineMeng / FGYM-user-demo
View on GitHub
Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning
☆14Aug 12, 2021Updated 4 years ago
RobRomijnders / VAE_rec
View on GitHub
Variational Recurrent Auto Encoder
☆15Jul 10, 2016Updated 10 years ago
vaaaaanquish / docker-UTH-BERT
View on GitHub
docker for UTH-BERT: https://ai-health.m.u-tokyo.ac.jp/uth-bert
☆14Mar 24, 2023Updated 3 years ago
veronicachelu / temporal_abstraction
View on GitHub
Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…
☆24Nov 29, 2018Updated 7 years ago