sebascuri/rllib

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sebascuri/rllib)

sebascuri / rllib

☆20

Alternatives and similar repositories for rllib

Users that are interested in rllib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lasgroup / opax
View on GitHub
☆19Jan 9, 2025Updated last year
sebascuri / hucrl
View on GitHub
☆32Nov 13, 2023Updated 2 years ago
martius-lab / caiac
View on GitHub
Code for the paper: Causal Action Influence Aware Counterfactual Data Augmentation @ICML2024
☆12Jul 19, 2024Updated 2 years ago
andrew-cr / online_var_fil
View on GitHub
Code for our paper: Online Variational Filtering and Parameter Learning
☆20Dec 8, 2021Updated 4 years ago
sisl / MPHRL
View on GitHub
Model Primitive Hierarchical Reinforcement Learning
☆13Dec 8, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
JeppeKlitgaard / barista
View on GitHub
A Python and Jsonnet framework for handling espanso configurations
☆11Oct 6, 2025Updated 9 months ago
schwartenbeckph / Mechanisms_Exploration_Paper
View on GitHub
Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"
☆11May 22, 2020Updated 6 years ago
shenao-zhang / reward-augmented-preference
View on GitHub
The official implementation of Preference Data Reward-Augmentation.
☆18May 1, 2025Updated last year
esennesh / dcpc_paper
View on GitHub
☆12Aug 26, 2025Updated 10 months ago
zehao99 / CEIT
View on GitHub
Python Package for EIT(Electric Impedance Tomography)-like problems using Gauss-Newton method.
☆17Nov 5, 2025Updated 8 months ago
FragileTech / plangym
View on GitHub
Library that provides environments for planning problems
☆17Apr 24, 2026Updated 3 months ago
gavlegoat / safe-learning
View on GitHub
☆18Jul 20, 2023Updated 3 years ago
rajaswa / feedback-and-memory-in-transformers
View on GitHub
My final project submission for the Meta Learning course at BITS Goa (conducted by TCS Research)
☆17May 3, 2021Updated 5 years ago
ronammar / collective_influence
View on GitHub
☆12Oct 13, 2017Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
facebookresearch / controllable_agent
View on GitHub
The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…
☆80Jul 17, 2023Updated 3 years ago
stephen-chung-mh / thinker
View on GitHub
Thinker project
☆16Sep 4, 2024Updated last year
agentification / Language-Integrated-VI
View on GitHub
☆21Apr 12, 2024Updated 2 years ago
humans-to-robots-motion / tp-rmp
View on GitHub
Learning Task-parametrized Riemannian Motion Policies from demonstrations.
☆17Dec 23, 2022Updated 3 years ago
is0383kk / Pytorch_VAE-GMM
View on GitHub
Implementation of mutual learning model between VAE and GMM.
☆29Oct 8, 2025Updated 9 months ago
Stanford-ILIAD / ELLA
View on GitHub
Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.
☆21Mar 9, 2021Updated 5 years ago
HarrieO / RankingComplexLayouts
View on GitHub
Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"
☆16Aug 28, 2018Updated 7 years ago
sisl / BetaZero.jl
View on GitHub
Belief-state planning for POMDPs using learned approximations
☆25Jan 21, 2025Updated last year
NiklasRosenstein / slap
View on GitHub
Slap is a CLI to assist in the process for developing and releasing Python packages.
☆26May 21, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
baymax-lin / Huawei-CodeCraft-2022
View on GitHub
2022华为软件精英挑战赛 - 杭厦赛区 - 土豪法称霸杭厦 - 决赛季军
☆14Jul 31, 2023Updated 2 years ago
MahanFathi / HJxB
View on GitHub
Continuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)
☆16Feb 1, 2022Updated 4 years ago
Kim-Hammar / gym-optimal-intrusion-response
View on GitHub
A Simulated Optimal Intrusion Response Game
☆21Apr 3, 2022Updated 4 years ago
pronkinnikita / pytorch-pretrained-BERT
View on GitHub
📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…
☆16Jun 9, 2019Updated 7 years ago
tk-rusch / unicornn
View on GitHub
Official code for UnICORNN (ICML 2021)
☆28Oct 1, 2021Updated 4 years ago
senya-ashukha / quantile-regression-dqn-pytorch
View on GitHub
A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning
☆97Sep 3, 2020Updated 5 years ago
weberrr / CrossDQN
View on GitHub
☆19Oct 21, 2021Updated 4 years ago
ibrahim-elshar / gym-windy-gridworlds
View on GitHub
Windy GridWorlds environments compatible with OpenAI gym.
☆15Jul 8, 2022Updated 4 years ago
akarshp28 / EIT-EBM
View on GitHub
EIT-EBM
☆22Jun 12, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Akella17 / Deep-Bayesian-Quadrature-Policy-Optimization
View on GitHub
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
☆17Feb 17, 2021Updated 5 years ago
ndawlab / seqanx
View on GitHub
Code associated with "Anxiety, avoidance, and sequential evaluation"
☆17Oct 26, 2021Updated 4 years ago
KamitaniLab / slir
View on GitHub
Python package for Sparse Linear Regression (SLiR)
☆21Jul 16, 2019Updated 7 years ago
IRVLUTD / neuralgrasps-dataset-generation
View on GitHub
Dataset generation for NeuralGrasps https://arxiv.org/abs/2207.02959
☆24Sep 26, 2024Updated last year
dannysdeng / dqn-pytorch
View on GitHub
PyTorch - Implicit Quantile Networks - Quantile Regression - C51
☆22Jul 26, 2019Updated 7 years ago
iwatake2222 / InferenceHelper_Sample_ROS
View on GitHub
DNN Node Collection using Inference Helper in ROS2
☆13Apr 24, 2022Updated 4 years ago
google-deepmind / enn_acme
View on GitHub
☆30Aug 25, 2022Updated 3 years ago