thanhnguyentang/mmdrl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thanhnguyentang/mmdrl)

thanhnguyentang / mmdrl

Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354

☆30

Alternatives and similar repositories for mmdrl

Users that are interested in mmdrl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ThibautTheate / Unconstrained-Monotonic-Deep-Q-Network-algorithm
View on GitHub
Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…
☆11Jun 3, 2022Updated 4 years ago
xtma / dsac
View on GitHub
Distributional Soft Actor Critic
☆63Jun 6, 2020Updated 6 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
twni2016 / self-predictive-rl
View on GitHub
Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024
☆27Apr 26, 2026Updated 3 months ago
qgallouedec / lge
View on GitHub
☆33Mar 19, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Jingliang-Duan / DSAC-v1
View on GitHub
DSAC; Distributional Soft Actor-Critic
☆142Feb 12, 2025Updated last year
toshikwa / fqf-iqn-qrdqn.pytorch
View on GitHub
PyTorch implementation of FQF, IQN and QR-DQN.
☆191Jul 25, 2024Updated 2 years ago
deligentfool / dqn_zoo
View on GitHub
The implement of all kinds of dqn reinforcement learning with Pytorch
☆97Mar 25, 2021Updated 5 years ago
MichaelArbel / MMD-gradient-flow
View on GitHub
☆12Jul 25, 2024Updated 2 years ago
facebookresearch / reward-estimator-corl
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆23Oct 26, 2018Updated 7 years ago
ben-eysenbach / info_geometry
View on GitHub
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Oct 6, 2021Updated 4 years ago
microsoft / FQF
View on GitHub
FQF(Fully parameterized Quantile Function for distributional reinforcement learning) is a general reinforcement learning framework for At…
☆48Sep 26, 2020Updated 5 years ago
JesseFarebro / distributional-sr
View on GitHub
Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".
☆23Nov 8, 2024Updated last year
LeonardoBerti00 / Multi-Horizon-Forecasting-for-Limit-Order-Books
View on GitHub
Pytorch implementation of DeepLOB-ATT and DeepLOB-Seq2Seq from Multi Horizon Forecasting for Limit Order Books
☆14Feb 4, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ido90 / CeSoR
View on GitHub
☆19Nov 22, 2023Updated 2 years ago
Silvicek / cvar-algorithms
View on GitHub
Risk-Averse Distributional Reinforcement Learning: Code
☆28Nov 25, 2018Updated 7 years ago
LucasAlegre / sac-plus
View on GitHub
Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).
☆15Feb 21, 2021Updated 5 years ago
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
MatthieuSarkis / Portfolio-Optimization-and-Goal-Based-Investment-with-Reinforcement-Learning
View on GitHub
☆20Mar 31, 2026Updated 3 months ago
BY571 / D4PG
View on GitHub
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…
☆24Apr 7, 2021Updated 5 years ago
Fedex100 / awesome-compilers
View on GitHub
☆18Jun 3, 2017Updated 9 years ago
shiwj16 / raa-drl
View on GitHub
☆11Apr 20, 2021Updated 5 years ago
mingen-pan / Reinforcement-Learning-Q-learning-Gridworld-Pytorch
View on GitHub
This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld
☆14Jul 13, 2020Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
NishanthVAnand / prediction-and-control-in-continual-reinforcement-learning
View on GitHub
Code to reproduce results from the paper: Prediction and Control in Continual Reinforcement Learning, NeurIPS 2023.
☆13May 10, 2024Updated 2 years ago
LeonardoBerti00 / Data-Normalization-for-Bilinear-Structures-in-High-Frequency-Financial-Time-series-BiN-TABL
View on GitHub
Pytorch implementation of BIN-TABL from Data Normalization for Bilinear Structures in HF Financial Time-series
☆14Aug 12, 2024Updated last year
rlai-lab / Regularized-GradientTD
View on GitHub
Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.
☆38Oct 14, 2020Updated 5 years ago
Kchu / DeepRL_PyTorch
View on GitHub
Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.
☆217Mar 15, 2023Updated 3 years ago
Hwhitetooth / jax_muzero
View on GitHub
An implementation of MuZero in JAX.
☆58Nov 8, 2022Updated 3 years ago
yojul / cognac
View on GitHub
Cooperative Graph-based Networked Agent Challenges for Multi-Agent Reinforcement Learning
☆15Jan 26, 2026Updated 6 months ago
ai-guild / convai
View on GitHub
resources and documentation on convai challenge
☆17May 30, 2017Updated 9 years ago
awwang10 / sphinx
View on GitHub
☆14Oct 23, 2025Updated 9 months ago
gcucurull / jax-gat
View on GitHub
JAX implementation of Graph Attention Networks
☆13Jan 29, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
google-research / reincarnating_rl
View on GitHub
[NeurIPS 2022] Open source code for reusing prior computational work in RL.
☆100Jul 5, 2023Updated 3 years ago
thanhnguyentang / offline_neural_bandits
View on GitHub
An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…
☆13Mar 13, 2022Updated 4 years ago
alexlioralexli / learned-fourier-features
View on GitHub
Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"
☆20Oct 2, 2022Updated 3 years ago
seolhokim / DistributedRL-Pytorch-Ray
View on GitHub
Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)
☆27Jun 8, 2022Updated 4 years ago
JQVeenstra / arfima
View on GitHub
Now updated prior to the version on CRAN.
☆15Jan 9, 2024Updated 2 years ago
markub3327 / rl-toolkit
View on GitHub
RL-Toolkit: A Research Framework for Robotics
☆21Jan 22, 2026Updated 6 months ago
Sakura-Fire-Capital / DoubleEnsembleML
View on GitHub
Blaze
☆17Jun 19, 2021Updated 5 years ago