apple/ml-uwac

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apple/ml-uwac)

apple / ml-uwac

☆35

Alternatives and similar repositories for ml-uwac

Users that are interested in ml-uwac are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yudasong / HyQ
View on GitHub
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
☆24Feb 16, 2023Updated 3 years ago
suyoung-lee / Episodic-Backward-Update
View on GitHub
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago
ryanxhr / DWBC
View on GitHub
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
☆35Jan 5, 2023Updated 3 years ago
snu-mllab / EDAC
View on GitHub
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
☆80Aug 14, 2022Updated 3 years ago
robintyh1 / icml2021-pengqlambda
View on GitHub
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
☆15Jul 23, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
asonabend / ESRL
View on GitHub
Code for Expert Supervised Reinforcement Learning
☆10Apr 7, 2021Updated 5 years ago
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
montrealrobotics / iv_rl
View on GitHub
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Jul 18, 2025Updated last year
ying-wen / gr2
View on GitHub
Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
☆14Dec 8, 2022Updated 3 years ago
google-research / deep_ope
View on GitHub
☆88Jul 30, 2024Updated last year
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
ajgupta93 / d4pg-pytorch
View on GitHub
In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.
☆19Jun 15, 2018Updated 8 years ago
tgangwani / GuidanceRewards
View on GitHub
Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)
☆12Jul 7, 2021Updated 5 years ago
mklissa / phi_gcn
View on GitHub
Reward Propagation using Graph Convolutional Networks
☆13Jun 19, 2021Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
clementbernardd / Count-Based-Exploration
View on GitHub
Our version of #Exploration: A Study of Count-Based Explorationfor Deep Reinforcement Learning for a class project
☆17Apr 30, 2021Updated 5 years ago
Facebear-ljx / PROTO
View on GitHub
☆17May 25, 2023Updated 3 years ago
srsohn / shortest-path-rl
View on GitHub
A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"
☆13Jul 19, 2021Updated 5 years ago
apexrl / Batch-Offline--RL-Paper-Lists
View on GitHub
Paper Collection for Batch RL with brief introductions.
☆85Feb 26, 2022Updated 4 years ago
clvrai / skill-chaining
View on GitHub
Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization (CoRL 2021)
☆38May 3, 2022Updated 4 years ago
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 2 years ago
agoldst / scuro
View on GitHub
R markdown format and template for light-on-dark beamer presentations—with fussy extras.
☆12Nov 1, 2021Updated 4 years ago
SwapnilPande / MOReL
View on GitHub
Model-Based Offline Reinforcement Learning
☆51Jan 13, 2021Updated 5 years ago
JuliaPOMDP / AdaOPS.jl
View on GitHub
An implementation of the AdaOPS (Adaptive Online Packing-based Search), which is an online POMDP Solver used to solve problems defined wi…
☆16Nov 16, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
secury / optidice
View on GitHub
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
☆16Aug 3, 2023Updated 2 years ago
hnjia00 / Delayed-Feedback
View on GitHub
☆10Jul 8, 2021Updated 5 years ago
tianheyu927 / mopo
View on GitHub
Code for MOPO: Model-based Offline Policy Optimization
☆191May 17, 2022Updated 4 years ago
hanjuku-kaso / awesome-offline-rl
View on GitHub
An index of algorithms for offline reinforcement learning (offline-rl)
☆1,072May 23, 2024Updated 2 years ago
dmksjfl / MCQ
View on GitHub
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆64Apr 29, 2024Updated 2 years ago
russellmendonca / mier_public
View on GitHub
☆13Mar 16, 2023Updated 3 years ago
zhuxingyu2021 / tinykv
View on GitHub
an implementation of a lsmtree
☆11Feb 18, 2023Updated 3 years ago
unrealcv / playground
View on GitHub
A minimal Unreal Engine project for developing and testing UnrealCV
☆17Nov 8, 2018Updated 7 years ago
kschweig / OfflineRL
View on GitHub
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
☆26Jan 16, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
JunjieWang95 / attention-based-lane-changing
View on GitHub
☆12Mar 15, 2022Updated 4 years ago
architsharma97 / earl_benchmark
View on GitHub
EARL: Environment for Autonomous Reinforcement Learning
☆37Nov 24, 2022Updated 3 years ago
olivierjeunen / pessimism-recsys-2021
View on GitHub
Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.
☆11Dec 15, 2022Updated 3 years ago
rasoolfa / P3O
View on GitHub
P3O paper code
☆30Aug 7, 2019Updated 6 years ago
sgiguere / RobinHood-NeurIPS-2019
View on GitHub
Implementation of safe offline bandit algorithms.
☆10Oct 27, 2019Updated 6 years ago
young-geng / JaxCQL
View on GitHub
Conservative Q learning in Jax
☆58Feb 7, 2023Updated 3 years ago
uncharted-technologies / risk-and-uncertainty
View on GitHub
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆31Nov 22, 2022Updated 3 years ago