Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
Alternatives and similar repositories for SEABO
Users that are interested in SEABO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code for the paper Imitation Learning from Observation with Automatic Discount Scheduling☆13Mar 27, 2024Updated 2 years ago
- ☆18Apr 11, 2024Updated 2 years ago
- ☆16Apr 14, 2026Updated 2 weeks ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆30Jan 12, 2023Updated 3 years ago
- A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning☆15Oct 22, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Mar 12, 2020Updated 6 years ago
- Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)☆15Aug 15, 2025Updated 8 months ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- ☆30Oct 3, 2023Updated 2 years ago
- [NeuIPS2024 DTQL] Diffusion Trusted Q-Learning for Offline RL — Official PyTorch Implementation☆24May 31, 2024Updated last year
- Kolmogorov Arnold Networks trained on MNIST☆12May 4, 2024Updated last year
- Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.☆22Mar 11, 2022Updated 4 years ago
- ICRA 2024☆17Mar 13, 2024Updated 2 years ago
- Symbolic Regression from Scratch with Python☆14Dec 6, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆63Jan 30, 2026Updated 3 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆26Aug 28, 2024Updated last year
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆20Dec 22, 2021Updated 4 years ago
- ☆11Mar 15, 2023Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Experiment utility code, specifically designed for use with Compute Canada.☆11Jan 27, 2025Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆62Apr 29, 2024Updated 2 years ago
- ☆10Mar 11, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [AAAI 2023] Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning☆11Apr 29, 2024Updated 2 years ago
- Synthetic Experience Replay☆111Apr 16, 2026Updated 2 weeks ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- ☆43May 25, 2023Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆97Dec 1, 2024Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆75Oct 18, 2022Updated 3 years ago
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 9 months ago
- ☆15Jun 1, 2023Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 11 months ago
- Simple MoE - Day 17 of 365 Days of Repos☆18Apr 21, 2026Updated last week
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- ☆14Jul 4, 2022Updated 3 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding☆62Apr 21, 2026Updated last week
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆18Jun 18, 2024Updated last year