☆330Jan 23, 2022Updated 4 years ago
Alternatives and similar repositories for implicit_q_learning
Users that are interested in implicit_q_learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆409Dec 18, 2021Updated 4 years ago
- Conservative Q learning in Jax☆58Feb 7, 2023Updated 3 years ago
- Code for conservative Q-learning☆486Dec 7, 2021Updated 4 years ago
- Conservative Q Learning on top of SAC☆139Oct 15, 2022Updated 3 years ago
- A PyTorch implementation of Implicit Q-Learning☆99Oct 23, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆756Oct 26, 2022Updated 3 years ago
- A collection of reference environments for offline reinforcement learning☆1,693Nov 18, 2024Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆98Dec 1, 2024Updated last year
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆1,363Aug 3, 2023Updated 2 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆46Jul 27, 2023Updated 2 years ago
- Repo for Implicit Diffusion Q-Learning☆125Dec 5, 2023Updated 2 years ago
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,067May 23, 2024Updated 2 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆121Jul 31, 2024Updated last year
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Oct 21, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for MOPO: Model-based Offline Policy Optimization☆190May 17, 2022Updated 4 years ago
- Collection of reinforcement learning algorithms☆2,912Jun 17, 2024Updated 2 years ago
- ☆44Sep 19, 2021Updated 4 years ago
- Corax: Core RL in JAX☆42Feb 22, 2024Updated 2 years ago
- ☆58Jan 20, 2023Updated 3 years ago
- ☆405Feb 13, 2023Updated 3 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆80Aug 14, 2022Updated 3 years ago
- A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities☆518Updated this week
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆24Feb 15, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆559Jun 26, 2023Updated 3 years ago
- Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.☆2,820Apr 29, 2024Updated 2 years ago
- ☆80Dec 9, 2022Updated 3 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- ☆364Oct 12, 2022Updated 3 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆168Oct 15, 2023Updated 2 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆665Apr 6, 2021Updated 5 years ago
- ☆19Jun 25, 2023Updated 3 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Aug 8, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Extreme Q-Learning: Max Entropy RL without Entropy☆88Feb 14, 2023Updated 3 years ago
- An offline deep reinforcement learning library☆1,667Sep 10, 2025Updated 9 months ago
- ☆203Mar 25, 2023Updated 3 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆30Jan 12, 2023Updated 3 years ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆872Aug 12, 2024Updated last year
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆44Dec 17, 2021Updated 4 years ago