Baichenjia/UTDS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Baichenjia/UTDS)

Baichenjia / UTDS

Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL

☆18

Alternatives and similar repositories for UTDS

Users that are interested in UTDS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Improbable-AI / dw-offline-rl
View on GitHub
Official implementation of NeurIPS'23 paper, Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
☆25Jan 29, 2024Updated 2 years ago
mj-hwang / offline-stable-baselines3
View on GitHub
Offline RL algoritms implemented in Stable Baselines3 (pytorch)
☆11Dec 7, 2021Updated 4 years ago
pcuenca / lpips-j
View on GitHub
Minimal JAX/Flax port of `lpips` supporting `vgg16`, with pre-trained weights stored in the 🤗 Hugging Face hub.
☆17Aug 1, 2022Updated 3 years ago
yminchen / rom-mpc-rl
View on GitHub
☆14Sep 26, 2023Updated 2 years ago
yandexdataschool / gumbel_dpg
View on GitHub
Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.
☆12Jun 20, 2017Updated 9 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
semitable / seps
View on GitHub
Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)
☆10Oct 26, 2021Updated 4 years ago
zhqiu / contrastive-learning-iSogCLR
View on GitHub
☆11Apr 6, 2024Updated 2 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
nissymori / remax-rl
View on GitHub
[ICML2026] Official JAX code for Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
☆15Jul 3, 2026Updated 3 weeks ago
rkcosner / cyberpod_sim_ros
View on GitHub
Segway Simulation Environment
☆11Dec 31, 2020Updated 5 years ago
ualejand / Reinforcement_Learning
View on GitHub
Adaptive PI controller based on a reinforcement learning algorithm for speed control of a DC motor
☆12Oct 5, 2023Updated 2 years ago
brentyi / transformer-exercises-jax
View on GitHub
☆18Apr 17, 2026Updated 3 months ago
minruixu / MAFRL
View on GitHub
code for
☆11Apr 10, 2021Updated 5 years ago
panagelak / 4WD-drive-arduino-code-with-rosserial-encoders-pid
View on GitHub
arduino due code to control a 4 wheeled differential vehicle by a cmd_vel callback using rosserial pid_arduino_library and quadrature_enc…
☆13Aug 3, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
i5lab / Platoon-Simulation
View on GitHub
Compare Laguerre-based MPC and Traditional MPC for platoon of vehicles.
☆13Feb 14, 2023Updated 3 years ago
NathanGavenski / IUPE
View on GitHub
Pytorch official implementation for Imitating Unknown Policies via Exploration.
☆14Oct 3, 2023Updated 2 years ago
Baichenjia / GHER
View on GitHub
G-HER algorithm
☆18May 24, 2019Updated 7 years ago
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
junhyeokahn / PyPnC
View on GitHub
Python Implementation of Planning and Control
☆60Feb 27, 2024Updated 2 years ago
Chenan-W / Multi-Bebop
View on GitHub
基于ROS的多无人机协同控制
☆12May 8, 2021Updated 5 years ago
nopaddleboat / generate-humanoid-walking-gaits
View on GitHub
This repository shows how to follow the guidelines from Pinocchio to generate walking gaits for a humanoid robot.
☆15Aug 31, 2023Updated 2 years ago
WentDong / Adapt
View on GitHub
Actuator Degeneration Adaptation Transformer
☆14Sep 19, 2023Updated 2 years ago
ValentinaZangirolami / MADRQN
View on GitHub
Multi-Agent Deep Recurrent Q-Learning with Bayesian epsilon-greedy on AirSim simulator
☆13Apr 1, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
cwjwudi / OmniIsaacGymEnvs-Cassie
View on GitHub
This is a clone of OmniIsaacGymEnvs, and it is used for my rl env test.
☆12May 4, 2023Updated 3 years ago
sail-sg / OPER
View on GitHub
code for the paper Offline Prioritized Experience Replay
☆12Jun 13, 2023Updated 3 years ago
t6-thu / H2Oplus
View on GitHub
[ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
☆13Apr 10, 2025Updated last year
gtfactslab / IROS2020_LearningBarriers
View on GitHub
Synthesis of Control Barrier Functions Using a Supervised Machine Learning Approach
☆13Apr 1, 2020Updated 6 years ago
MAX-GitHub-Z / STM32F1_FOC_FreeRTOS
View on GitHub
学习在STM32F103上运行FreeRTOS的同时运行FOC算法驱动电机
☆13Apr 3, 2023Updated 3 years ago
bikcrum / ppo_transformer
View on GitHub
Implementation of Proximal Policy Optimization using Transformer
☆12Jul 4, 2023Updated 3 years ago
kwanyoungpark / LEQ
View on GitHub
Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning
☆19Feb 6, 2025Updated last year
pr-shukla / Pursuit-Evasion-Reinforcement-Learning
View on GitHub
Application of DDPG on Pursuit-Evasion Problem
☆13Feb 3, 2021Updated 5 years ago
AdroitAnandAI / Topic-Modelling-LDA-NMF-W2V
View on GitHub
Numerical combination of LDA and NMF cascaded with W2V to categorize 1M+ multi-lingual records into a 275-node, 5-level deep category tre…
☆11Aug 29, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
elliottower / gobblet-rl
View on GitHub
Interactive Multi-Agent Reinforcement Learning Environment for the board game Gobblet using PettingZoo.
☆12Jul 2, 2023Updated 3 years ago
rmcong / TNet_TMM2022
View on GitHub
☆15Jan 17, 2023Updated 3 years ago
igsor / HDPy
View on GitHub
Heuristic Dynamic Programming with Python
☆14Jul 28, 2014Updated 12 years ago
XiaotaoGuo / Effective-Cpp-Reading-Note
View on GitHub
Effective C++: 55 Specific Ways to Improve Your Programs and Designs (Ver.3) Reading Notes
☆14Feb 21, 2021Updated 5 years ago
avillaflor / SPLT-transformer
View on GitHub
☆18Jul 10, 2022Updated 4 years ago
RobertTLange / minimal-meta-rl
View on GitHub
Minimal A2C/A3C example of an LSTM-based meta-learner.
☆13Feb 2, 2021Updated 5 years ago
222464 / TeensyAtariPlayingAgent
View on GitHub
An agent for playing Atari games running on a Teensy microcontroller
☆14Nov 11, 2022Updated 3 years ago