The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Forward-Backward representation.
☆77Jul 17, 2023Updated 2 years ago
Alternatives and similar repositories for controllable_agent
Users that are interested in controllable_agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆59Jun 6, 2023Updated 3 years ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆23Jun 16, 2024Updated last year
- ☆15Dec 14, 2024Updated last year
- Learning diverse options through the Laplacian representation.☆23Jan 5, 2024Updated 2 years ago
- Contextual Bandit Spectral Representation Learner☆12Oct 25, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)☆13Oct 30, 2023Updated 2 years ago
- [ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps☆13Apr 10, 2025Updated last year
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Nov 19, 2023Updated 2 years ago
- ☆364Oct 12, 2022Updated 3 years ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆24Nov 8, 2024Updated last year
- Code for the paper: Causal Action Influence Aware Counterfactual Data Augmentation @ICML2024☆12Jul 19, 2024Updated last year
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆29Jan 14, 2025Updated last year
- A tool for recording RL trajectories.☆119Updated this week
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆21Dec 16, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆58May 19, 2025Updated last year
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆11Feb 28, 2023Updated 3 years ago
- Dual optimization to learn laplacian eigenpairs in arbitrary spaces☆18Dec 18, 2024Updated last year
- ☆35Jan 4, 2023Updated 3 years ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 3 years ago
- ☆17May 25, 2023Updated 3 years ago
- Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)☆25Mar 4, 2022Updated 4 years ago
- [AAMAS'26] xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing☆26Jan 8, 2026Updated 5 months ago
- Least Squares Policy Iteration (LSPI) in Python☆11May 25, 2015Updated 11 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICML 2025] The Official Implementation of "Efficient Robotic Policy Learning via Latent Space Backward Planning"☆30Dec 15, 2025Updated 6 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆104Sep 29, 2025Updated 8 months ago
- ☆23Nov 3, 2023Updated 2 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆139Aug 15, 2023Updated 2 years ago
- A variant of Varibad that is robust to difficult tasks☆11Aug 30, 2023Updated 2 years ago
- official implementation of ODICE☆19Jan 31, 2024Updated 2 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆29Jun 3, 2023Updated 3 years ago
- ☆20Nov 13, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆26May 12, 2025Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆91Oct 15, 2023Updated 2 years ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆269Jun 6, 2026Updated last week
- Separating value functions across time-scales.☆17May 13, 2019Updated 7 years ago
- Code for the paper "Learning to Assist Humans without Inferring Rewards"☆20Jul 7, 2024Updated last year
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.☆36Jan 12, 2024Updated 2 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆30Jan 12, 2023Updated 3 years ago