typoverflow/UtilsRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/typoverflow/UtilsRL)

typoverflow / UtilsRL

A python module designed for agile RL algorithm developing.

☆26

Alternatives and similar repositories for UtilsRL

Users that are interested in UtilsRL are comparing it to the libraries listed below

Sorting:

FanmingL / SmartLogger
View on GitHub
☆12May 14, 2024Updated last year
LAMDA-RL / OfflineRL-Lib
View on GitHub
Benchmarked implementations of Offline RL Algorithms.
☆77Mar 4, 2025Updated 11 months ago
DrZero0 / MACC
View on GitHub
The implementation of IJCAI'22 paper "Multi-Agent Concentrative Coordination with Decentralized Task Representation".
☆18May 1, 2022Updated 3 years ago
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated 11 months ago
liyc-ai / RL-pytorch
View on GitHub
A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.
☆26Jan 27, 2026Updated last month
x35f / unstable_baselines
View on GitHub
Re-implementations of SOTA RL algorithms.
☆136Sep 7, 2023Updated 2 years ago
lamda-bbo / madac
View on GitHub
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
☆26Mar 6, 2023Updated 2 years ago
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
polixir / d3pe
View on GitHub
D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.
☆11Jun 2, 2022Updated 3 years ago
AIDefender / MyDiscor
View on GitHub
Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"
☆14May 24, 2021Updated 4 years ago
xionghuichen / RLAssistant
View on GitHub
RLA is a tool for managing your RL experiments automatically
☆72Feb 7, 2023Updated 3 years ago
xionghuichen / MAPLE
View on GitHub
The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)
☆25Jan 16, 2024Updated 2 years ago
jiangsy / LAMDA-Beamer-Template
View on GitHub
A beamer template for LAMDA lab at NJU
☆16Oct 17, 2020Updated 5 years ago
rraileanu / policy-dynamics-value-functions
View on GitHub
☆33Aug 30, 2024Updated last year
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
typoverflow / .dotfiles
View on GitHub
A repo containing bash scripts to deploy reinforcement learning dev environment within one click!
☆10May 15, 2025Updated 9 months ago
mansicer / MAIC
View on GitHub
The implementation of AAAI 2022 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".
☆63Dec 12, 2023Updated 2 years ago
mansicer / self-verification
View on GitHub
☆17Dec 23, 2025Updated 2 months ago
tianxusky / Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
View on GitHub
☆10Oct 15, 2020Updated 5 years ago
0xWelt / BibTeX-Formatter
View on GitHub
Format your bibtex (.bib) file to help standardize citations for conference and journal submissions
☆14Nov 23, 2025Updated 3 months ago
microsoft / MAMBA
View on GitHub
Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
apexrl / autombpo
View on GitHub
Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>
☆13Nov 16, 2021Updated 4 years ago
ttumiel / minRLHF
View on GitHub
Minimal RLHF implementation built on top of minGPT.
☆32Jul 4, 2024Updated last year
zzq-bot / offline-marl-framework-offpymarl
View on GitHub
Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.
☆35Oct 7, 2024Updated last year
sail-sg / PatchAIL
View on GitHub
Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>
☆14Feb 15, 2023Updated 3 years ago
LAMDA-RL / ACT
View on GitHub
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆17Feb 10, 2024Updated 2 years ago
chenf-ai / Multi-Agent-Communication-Considering-Representation-Learning
View on GitHub
☆30Dec 22, 2022Updated 3 years ago
typoverflow / flow-rl
View on GitHub
Flow RL is a high-performance RL library with flow and diffusion models.
☆28Updated this week
typoverflow / Pirror
View on GitHub
基于树莓派（Pi）和PyGame的魔镜（Mirror）
☆18Aug 5, 2022Updated 3 years ago
jiangsy / slbo_pytorch
View on GitHub
☆15Sep 14, 2020Updated 5 years ago
liuxhym / EDIS
View on GitHub
EDIS: Energy-guided DIffusion Sampling
☆18Aug 10, 2024Updated last year
polixir / NeoRL2
View on GitHub
☆19Oct 27, 2025Updated 4 months ago
dennisl88 / rand_param_envs
View on GitHub
Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7
☆20Feb 14, 2019Updated 7 years ago
danielshin1 / oprl
View on GitHub
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Dec 30, 2022Updated 3 years ago
yunke-wang / WGAIL
View on GitHub
[ICML 2021] Learning to Weight Imperfect Demonstrations
☆20Nov 4, 2022Updated 3 years ago
FanmingL / ESCP
View on GitHub
Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy
☆20Jun 1, 2022Updated 3 years ago
OffDynamicsRL / off-dynamics-rl
View on GitHub
☆63Jan 30, 2026Updated last month
polixir / OfflineRL
View on GitHub
A collection of offline reinforcement learning algorithms.
☆208Nov 26, 2024Updated last year
LanqingLi1993 / FOCAL-ICLR
View on GitHub
Code for FOCAL Paper Published at ICLR 2021
☆55Dec 4, 2023Updated 2 years ago