dmksjfl/SEABO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dmksjfl/SEABO)

dmksjfl / SEABO

Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning

☆12

Alternatives and similar repositories for SEABO

Users that are interested in SEABO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dwjshift / IL_ADS
View on GitHub
code for the paper Imitation Learning from Observation with Automatic Discount Scheduling
☆13Mar 27, 2024Updated 2 years ago
ethanluoyc / optimal_transport_reward
View on GitHub
☆18Apr 11, 2024Updated 2 years ago
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
JasonMa2016 / SMODICE
View on GitHub
Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…
☆30Jan 12, 2023Updated 3 years ago
hari-sikchi / DVL
View on GitHub
A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning
☆16Oct 22, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tesslerc / TD3-JAX
View on GitHub
A JAX Implementation of the Twin Delayed DDPG Algorithm
☆35Mar 12, 2020Updated 6 years ago
dmksjfl / PAR
View on GitHub
Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)
☆15Aug 15, 2025Updated 11 months ago
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
d5rlbenchmark / d5rl
View on GitHub
☆31Oct 3, 2023Updated 2 years ago
TianyuCodings / Diffusion_Trusted_Q_Learning
View on GitHub
[NeuIPS2024 DTQL] Diffusion Trusted Q-Learning for Offline RL — Official PyTorch Implementation
☆27May 31, 2024Updated 2 years ago
DongsuLeeTech / AD4RL
View on GitHub
ICRA 2024
☆18Mar 13, 2024Updated 2 years ago
ale93111 / pykan_mnist
View on GitHub
Kolmogorov Arnold Networks trained on MNIST
☆12May 4, 2024Updated 2 years ago
dmksjfl / DARC
View on GitHub
Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.
☆22Mar 11, 2022Updated 4 years ago
datarobot-community / symbolic-regression-python
View on GitHub
Symbolic Regression from Scratch with Python
☆14Dec 6, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
OffDynamicsRL / off-dynamics-rl
View on GitHub
☆65Jan 30, 2026Updated 5 months ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
quantumiracle / Consistency_Model_For_Reinforcement_Learning
View on GitHub
Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24
☆27Aug 28, 2024Updated last year
wenzhe-li / romi
View on GitHub
Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"
☆20Dec 22, 2021Updated 4 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
dmksjfl / MCQ
View on GitHub
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆64Apr 29, 2024Updated 2 years ago
andnp / PyExpUtils
View on GitHub
Experiment utility code, specifically designed for use with Compute Canada.
☆11Jan 27, 2025Updated last year
NHirose / ExAug
View on GitHub
☆11Mar 15, 2023Updated 3 years ago
yunke-wang / UID
View on GitHub
[AAAI 2023] Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
☆11Apr 29, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
Shuyan98 / BTH
View on GitHub
This repo holds the Pytorch codes and models for the BTH framework presented on CVPR 2021
☆33Jun 10, 2021Updated 5 years ago
polixir / morec
View on GitHub
☆10Mar 11, 2024Updated 2 years ago
conglu1997 / SynthER
View on GitHub
Synthetic Experience Replay
☆114Apr 16, 2026Updated 3 months ago
jhejna / inverse-preference-learning
View on GitHub
☆43May 25, 2023Updated 3 years ago
YiqinYang / ICQ
View on GitHub
Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…
☆76Oct 18, 2022Updated 3 years ago
seohongpark / HIQL
View on GitHub
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆98Dec 1, 2024Updated last year
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
chiha8888 / NCTU-ML-class
View on GitHub
☆17Jul 6, 2020Updated 6 years ago
Alestaubin / stable-imitation-policy-with-waypoints
View on GitHub
Learning globally stable dynamical systems policies through imitation. A modification of the original work, focussing on waypoint-based i…
☆14Oct 12, 2024Updated last year
saizhang0218 / TMC
View on GitHub
Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"
☆27Dec 6, 2020Updated 5 years ago
ryanxhr / DWBC
View on GitHub
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
☆36Jan 5, 2023Updated 3 years ago
Asap7772 / OfflineRlWorkflow
View on GitHub
This repository accompanies the following paper: A Workflow for Offline Model-Free Robotic RL
☆13Nov 5, 2021Updated 4 years ago
osilab-kaist / smac_exp
View on GitHub
An open source benchmark for Multi Agent Reinforcement Learning
☆31Jul 15, 2023Updated 3 years ago
henry-prior / jax-rl
View on GitHub
JAX implementations of core Deep RL algorithms
☆84May 2, 2022Updated 4 years ago