Conservative Q learning in Jax
☆57Feb 7, 2023Updated 3 years ago
Alternatives and similar repositories for JaxCQL
Users that are interested in JaxCQL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Conservative Q Learning on top of SAC☆138Oct 15, 2022Updated 3 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆120Jul 31, 2024Updated last year
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆46Jul 27, 2023Updated 2 years ago
- ☆322Jan 23, 2022Updated 4 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Sep 24, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆94Dec 1, 2024Updated last year
- [NeurIPS 2024] Doubly Mild Generalization for Offline Reinforcement Learning☆16Oct 29, 2025Updated 4 months ago
- ☆53Jan 20, 2023Updated 3 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆72Feb 2, 2023Updated 3 years ago
- ☆18Mar 18, 2026Updated last week
- ☆23Aug 19, 2022Updated 3 years ago
- ☆10Mar 11, 2024Updated 2 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆80Aug 14, 2022Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112May 27, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repo for Implicit Diffusion Q-Learning☆123Dec 5, 2023Updated 2 years ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆32Oct 26, 2022Updated 3 years ago
- ☆57Feb 8, 2025Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆400Dec 18, 2021Updated 4 years ago
- Corax: Core RL in JAX☆41Feb 22, 2024Updated 2 years ago
- Synthetic Experience Replay☆110May 27, 2024Updated last year
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆753Oct 26, 2022Updated 3 years ago
- ☆15Jan 18, 2026Updated 2 months ago
- The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)☆44Mar 6, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An elegant PyTorch offline reinforcement learning library for researchers.☆386Jul 11, 2025Updated 8 months ago
- ☆80Dec 9, 2022Updated 3 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆53Oct 18, 2021Updated 4 years ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Nov 19, 2023Updated 2 years ago
- A benchmark for offline goal-conditioned RL and offline RL☆347Jan 14, 2026Updated 2 months ago
- ☆19Jun 25, 2023Updated 2 years ago
- [NeurIPS 2024] Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression☆14Oct 29, 2025Updated 4 months ago
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆25Dec 5, 2023Updated 2 years ago
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated last year
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Feb 20, 2026Updated last month
- Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning☆16Feb 6, 2025Updated last year
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆36Dec 30, 2024Updated last year
- A simple and easy to use implementation of the soft actor-critic algorithm.☆15Sep 2, 2022Updated 3 years ago
- A PyTorch implementation of Implicit Q-Learning☆97Oct 23, 2021Updated 4 years ago