gauthamvasan / avgLinks
Action Value Gradient Algorithm
☆26Updated 6 months ago
Alternatives and similar repositories for avg
Users that are interested in avg are comparing it to the libraries listed below
Sorting:
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆79Updated last month
- ☆27Updated last month
- Corax: Core RL in JAX☆38Updated last year
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆71Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆81Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆34Updated last year
- ☆112Updated 9 months ago
- ☆30Updated last year
- Foundation Policies with Hilbert Representations (ICML 2024)☆102Updated 2 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- The official implementation of "Horizon Reduction Makes RL Scalable"☆169Updated 4 months ago
- ☆23Updated last year
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆77Updated last year
- ☆46Updated 2 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- Repo for Implicit Diffusion Q-Learning☆117Updated 2 years ago
- ☆32Updated last year
- ☆35Updated 6 months ago
- PWM: Policy Learning with Large World Models☆60Updated 4 months ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 4 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆80Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆91Updated last year
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆21Updated 10 months ago
- ☆10Updated last year
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆128Updated 5 months ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- ☆51Updated 2 years ago
- [ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)☆42Updated 9 months ago
- A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning☆15Updated 2 years ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆171Updated last month