Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"
☆50Jun 26, 2024Updated last year
Alternatives and similar repositories for effective-horizon
Users that are interested in effective-horizon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆22Nov 18, 2022Updated 3 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆28Jun 3, 2023Updated 2 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆34Nov 13, 2023Updated 2 years ago
- Image-based gridworld experiment for learning Markov state abstractions☆21Sep 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives (Deep RL Workshop 2021)☆50Feb 15, 2022Updated 4 years ago
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆34Dec 14, 2023Updated 2 years ago
- General Modules for JAX☆73Apr 7, 2026Updated last month
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Apr 13, 2026Updated 3 weeks ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆239Nov 24, 2025Updated 5 months ago
- ☆11Mar 30, 2020Updated 6 years ago
- The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.☆15Mar 22, 2023Updated 3 years ago
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Jun 23, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Comparison between GFlowNets & Maximum Entropy RL☆19Feb 19, 2024Updated 2 years ago
- Simple tools for statistical analyses in RL experiments☆67Jun 21, 2018Updated 7 years ago
- ☆23Nov 11, 2024Updated last year
- flexible meta-learning in jax☆16Oct 19, 2023Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Jul 27, 2021Updated 4 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- ☆18Nov 16, 2020Updated 5 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 5 years ago
- ☆19Apr 22, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient☆30Mar 16, 2026Updated last month
- ☆15Apr 5, 2023Updated 3 years ago
- A collection of meta-learning algorithms in Jax☆25Sep 3, 2022Updated 3 years ago
- This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.☆51Dec 4, 2023Updated 2 years ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆29Jul 11, 2024Updated last year
- ☆34Oct 16, 2021Updated 4 years ago
- ☆15Jul 25, 2023Updated 2 years ago
- Flax Implementation of DreamerV3 on Crafter☆18Nov 29, 2025Updated 5 months ago
- ☆262Mar 11, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [deprecated] Engine Agnostic Gym Environment for Robotics☆17Feb 10, 2022Updated 4 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆65Mar 24, 2023Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Feb 3, 2022Updated 4 years ago
- DMControl Generalization Benchmark☆189Jan 3, 2024Updated 2 years ago
- ☆18Sep 7, 2023Updated 2 years ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆24Nov 8, 2024Updated last year
- Reinforcement learning for an AirSim quadrotor implemented in Unity☆12Sep 22, 2021Updated 4 years ago