facebookresearch/controllable_agent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/controllable_agent)

facebookresearch / controllable_agent

The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Forward-Backward representation.

☆80

Alternatives and similar repositories for controllable_agent

Users that are interested in controllable_agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ahmed-touati / controllable_agent
View on GitHub
☆61Jun 6, 2023Updated 3 years ago
adaptive-intelligent-robotics / QDAC
View on GitHub
Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …
☆24Jun 16, 2024Updated 2 years ago
enjeeneer / zero-shot-rl
View on GitHub
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆29Jan 14, 2025Updated last year
mklissa / dceo
View on GitHub
Learning diverse options through the Laplacian representation.
☆23Jan 5, 2024Updated 2 years ago
facebookresearch / xbanditsrl
View on GitHub
Contextual Bandit Spectral Representation Learner
☆13Oct 25, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
upiterbarg / hihack
View on GitHub
[NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)
☆13Oct 30, 2023Updated 2 years ago
t6-thu / H2Oplus
View on GitHub
[ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
☆13Apr 10, 2025Updated last year
dibyaghosh / icvf_release
View on GitHub
Public code for "Reinforcement Learning from Passive Data via Latent Intentions"
☆89Nov 19, 2023Updated 2 years ago
rll-research / url_benchmark
View on GitHub
☆367Oct 12, 2022Updated 3 years ago
JesseFarebro / distributional-sr
View on GitHub
Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".
☆23Nov 8, 2024Updated last year
martius-lab / caiac
View on GitHub
Code for the paper: Causal Action Influence Aware Counterfactual Data Augmentation @ICML2024
☆12Jul 19, 2024Updated 2 years ago
imoneoi / RSP_JAX
View on GitHub
[AAAI'25 Oral] Are Expressive Models Truly Necessary for Offline RL?
☆15Dec 10, 2024Updated last year
Princeton-RL / contrastive-successor-features
View on GitHub
☆17Dec 14, 2024Updated last year
arushijain94 / SafeOptionCritic
View on GitHub
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
☆21Dec 16, 2018Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
quasimetric-learning / quasimetric-rl
View on GitHub
Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023
☆61May 19, 2025Updated last year
google-deepmind / envlogger
View on GitHub
A tool for recording RL trajectories.
☆123Updated this week
tseyde / decqn
View on GitHub
☆35Jan 4, 2023Updated 3 years ago
tarod13 / laplacian_dual_dynamics
View on GitHub
Dual optimization to learn laplacian eigenpairs in arbitrary spaces
☆18Dec 18, 2024Updated last year
Facebear-ljx / RGM
View on GitHub
The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)
☆16Mar 3, 2023Updated 3 years ago
Facebear-ljx / PROTO
View on GitHub
☆17May 25, 2023Updated 3 years ago
t6-thu / xTED
View on GitHub
[AAMAS'26] xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing
☆26Jan 8, 2026Updated 6 months ago
tristandeleu / jax-comln
View on GitHub
Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)
☆25Mar 4, 2022Updated 4 years ago
rhololkeolke / lspi-python
View on GitHub
Least Squares Policy Iteration (LSPI) in Python
☆11May 25, 2015Updated 11 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
seohongpark / HILP
View on GitHub
Foundation Policies with Hilbert Representations (ICML 2024)
☆104Sep 29, 2025Updated 9 months ago
MichalBortkiewicz / JaxGCRL
View on GitHub
Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.
☆273Jun 6, 2026Updated last month
facebookresearch / denoised_mdp
View on GitHub
Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"
☆140Aug 15, 2023Updated 2 years ago
ido90 / RobustMetaRL
View on GitHub
A variant of Varibad that is robust to difficult tasks
☆11Aug 30, 2023Updated 2 years ago
nuria95 / O-RAAC
View on GitHub
Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting
☆36Feb 9, 2021Updated 5 years ago
sinaghiassian / OffpolicyAlgorithms
View on GitHub
☆23Nov 9, 2021Updated 4 years ago
maoliyuan / ODICE-Pytorch
View on GitHub
official implementation of ODICE
☆19Jan 31, 2024Updated 2 years ago
seohongpark / CSD-locomotion
View on GitHub
Controllability-Aware Unsupervised Skill Discovery (ICML 2023)
☆30Jun 3, 2023Updated 3 years ago
pcheng2 / TSRL
View on GitHub
☆23Nov 3, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
seohongpark / METRA
View on GitHub
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆92Oct 15, 2023Updated 2 years ago
facebookresearch / td-delta
View on GitHub
Separating value functions across time-scales.
☆18May 13, 2019Updated 7 years ago
sebascuri / rllib
View on GitHub
☆20Nov 13, 2023Updated 2 years ago
kvfrans / jaxtransformer
View on GitHub
Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...
☆16May 28, 2025Updated last year
MichaelTMatthews / Craftax_Baselines
View on GitHub
☆28Jun 16, 2026Updated last month
facebookresearch / agenthive
View on GitHub
AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.
☆36Jan 12, 2024Updated 2 years ago
JasonMa2016 / SMODICE
View on GitHub
Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…
☆30Jan 12, 2023Updated 3 years ago