Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"
☆50Jun 26, 2024Updated last year
Alternatives and similar repositories for effective-horizon
Users that are interested in effective-horizon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆23Nov 18, 2022Updated 3 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆29Jun 3, 2023Updated 2 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆34Nov 13, 2023Updated 2 years ago
- Image-based gridworld experiment for learning Markov state abstractions☆20Sep 16, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives (Deep RL Workshop 2021)☆50Feb 15, 2022Updated 4 years ago
- ☆11Jun 13, 2024Updated last year
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆34Dec 14, 2023Updated 2 years ago
- General Modules for JAX☆73Apr 7, 2026Updated last month
- ☆13Mar 10, 2026Updated 2 months ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Apr 13, 2026Updated last month
- Simple single-file baselines for Q-Learning in pure-GPU setting☆239Nov 24, 2025Updated 6 months ago
- ☆11Mar 30, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.☆15Mar 22, 2023Updated 3 years ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated 11 months ago
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Jun 23, 2022Updated 3 years ago
- Comparison between GFlowNets & Maximum Entropy RL☆19Feb 19, 2024Updated 2 years ago
- ☆23Nov 11, 2024Updated last year
- Simple tools for statistical analyses in RL experiments☆67Jun 21, 2018Updated 7 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- flexible meta-learning in jax☆16Oct 19, 2023Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Jul 27, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- ☆18Nov 16, 2020Updated 5 years ago
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 5 years ago
- HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient☆30Mar 16, 2026Updated 2 months ago
- ☆19Apr 22, 2024Updated 2 years ago
- ☆15Apr 5, 2023Updated 3 years ago
- A collection of meta-learning algorithms in Jax☆24Sep 3, 2022Updated 3 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆29Jul 11, 2024Updated last year
- ☆34Oct 16, 2021Updated 4 years ago
- ☆15Jul 25, 2023Updated 2 years ago
- Flax Implementation of DreamerV3 on Crafter☆18Nov 29, 2025Updated 6 months ago
- ☆264Mar 11, 2026Updated 2 months ago
- Sandbox environment for generalizable agent research☆27Aug 19, 2022Updated 3 years ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Oct 12, 2023Updated 2 years ago