Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.
☆32Nov 12, 2024Updated last year
Alternatives and similar repositories for GTA
Users that are interested in GTA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Mar 11, 2024Updated 2 years ago
- Synthetic Experience Replay☆114Apr 16, 2026Updated 2 months ago
- ☆64Nov 15, 2024Updated last year
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆55Aug 26, 2023Updated 2 years ago
- ☆39Jul 2, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The collection of my research papers' illustrations.☆22Oct 15, 2023Updated 2 years ago
- ☆20Nov 3, 2024Updated last year
- Learning energy decompositions for partial inference in GFlowNets☆16Jun 4, 2024Updated 2 years ago
- Official Code for Local Search GFlowNets (ICLR 2024 Spotlight)☆25Feb 27, 2025Updated last year
- ☆61Feb 3, 2023Updated 3 years ago
- Official repo for arxiv paper "Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion I…☆17Nov 8, 2024Updated last year
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated 2 years ago
- Q-learning with Adjoint Matching☆96May 11, 2026Updated last month
- QGFN: Controllable Greediness with Action Values - Code☆11May 17, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for our paper "Unlocking Guidance for Discrete State-Space Diffusion and Flow Models"☆34Apr 18, 2025Updated last year
- [ICML 2023] Official code for "DevFormer: A Symmetric Transformer for Context-Aware Device Placement"☆23Dec 7, 2024Updated last year
- Official Code for Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization (NIPS 2024)☆23Aug 15, 2024Updated last year
- 📰 [TMLR 2026 Survey Certification] Must-Read Papers on Offline Model-Based Optimization 🔥☆30Jan 27, 2026Updated 5 months ago
- Course Website for "AI618: Generative Model and Unsupervised Learning"☆36May 23, 2023Updated 3 years ago
- A PyTorch implementation of Advantage weighted Actor-Critic (AWAC)☆56Mar 30, 2021Updated 5 years ago
- Towards Foundation Models for Mixed Integer Linear Programming☆17Feb 3, 2025Updated last year
- Official repository for GFACS☆36May 17, 2024Updated 2 years ago
- ☆29Nov 5, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official code repository for the paper Exploring Chemical Space with Score-based Out-of-distribution Generation (ICML 2023)☆40May 19, 2024Updated 2 years ago
- ☆36Jun 7, 2024Updated 2 years ago
- Prioritized Generative Replay (ICLR 2025 Oral)☆29Mar 1, 2025Updated last year
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆12Jul 15, 2023Updated 2 years ago
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆30Jul 18, 2024Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆28Jul 19, 2023Updated 2 years ago
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆66Feb 12, 2025Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Feb 3, 2022Updated 4 years ago
- ☆105May 31, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A list of Offline to Online RL papers (continually updated)☆100Apr 25, 2026Updated 2 months ago
- The official implementation of flow Q-learning (FQL)☆319Jul 21, 2025Updated 11 months ago
- [NeurIPS 2023] Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans☆22Jan 31, 2024Updated 2 years ago
- [ICML'2023 Oral] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆81Oct 21, 2023Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆91Oct 15, 2023Updated 2 years ago
- ☆371May 1, 2023Updated 3 years ago
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆1,363Aug 3, 2023Updated 2 years ago