jsikyoon/V-MPO_torch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jsikyoon/V-MPO_torch)

jsikyoon / V-MPO_torch

V-MPO torch version with DMLab30 and GTrXL

☆13

Alternatives and similar repositories for V-MPO_torch

Users that are interested in V-MPO_torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YYCAAA / V-MPO_Lunarlander
View on GitHub
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆48Nov 10, 2020Updated 5 years ago
RodkinIvan / Transformer-RL
View on GitHub
Transformers (GTrXL & CoBERL) applied to RL tasks
☆29Aug 18, 2022Updated 3 years ago
alantess / gtrxl-torch
View on GitHub
Gated Transformer Model for Computer Vision
☆25Jul 11, 2021Updated 5 years ago
wisnunugroho21 / reinforcement_learning_v_mpo
View on GitHub
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆16Oct 23, 2021Updated 4 years ago
JimZhouZZY / RNaD-JunQi
View on GitHub
基于 Regularized Nash Dynamics 的军棋AI
☆14Apr 2, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
jerrodparker20 / adaptive-transformers-in-rl
View on GitHub
Adaptive Attention Span for Reinforcement Learning
☆136May 11, 2020Updated 6 years ago
pranavAL / DART
View on GitHub
Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024
☆11Jun 6, 2024Updated 2 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
junmokane / spatially-aware-transformer
View on GitHub
☆10Dec 10, 2024Updated last year
IouJenLiu / HTS-RL
View on GitHub
☆21Dec 22, 2020Updated 5 years ago
jsikyoon / OCRL
View on GitHub
Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…
☆12Feb 23, 2024Updated 2 years ago
martius-lab / GateL0RD-paper
View on GitHub
Code for the paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains
☆11Nov 12, 2021Updated 4 years ago
feidieufo / homework
View on GitHub
Assignments for CS294-112.
☆30Sep 11, 2019Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
XiaojuanTang / Mars
View on GitHub
a benchmark to evaluate the situated inductive reasoning
☆16Jan 7, 2025Updated last year
teslacool / m-curl
View on GitHub
M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning
☆29Nov 5, 2020Updated 5 years ago
kaloureyes3 / v4-clients
View on GitHub
☆10Apr 5, 2024Updated 2 years ago
dhruvramani / Transformers-RL
View on GitHub
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
☆183Feb 21, 2023Updated 3 years ago
sail-sg / optim4rl
View on GitHub
Optim4RL is a Jax framework of learning to optimize for reinforcement learning.
☆28Nov 27, 2024Updated last year
vlad17 / mve
View on GitHub
MVE: model-based value estimation
☆11Jul 30, 2018Updated 7 years ago
xihuai18 / A2PO-ICLR2023
View on GitHub
Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)
☆32Nov 22, 2025Updated 8 months ago
MAS-anony / ASN
View on GitHub
☆34Dec 8, 2022Updated 3 years ago
Gouet / QMIX-Starcraft
View on GitHub
☆17Dec 4, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kvfrans / rlbase_stable
View on GitHub
☆46Jul 12, 2024Updated 2 years ago
ademiadeniji / irm
View on GitHub
Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)
☆42Jan 13, 2024Updated 2 years ago
kethan-1818 / 5G-channel-modulation-using-RL
View on GitHub
I have developed a custom environment using OpenAI Gym in Python for simulating a 5G wireless communication channel as part of a reinforc…
☆14Mar 27, 2024Updated 2 years ago
jurgisp / memory-maze
View on GitHub
Evaluating long-term memory of reinforcement learning algorithms
☆180Jun 23, 2023Updated 3 years ago
Mehooz / BIRD_code
View on GitHub
Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".
☆14May 23, 2021Updated 5 years ago
dbcbtc / RL-Papers
View on GitHub
papers about reinforcement learning
☆13Jan 4, 2021Updated 5 years ago
xunger99 / SAAC-StarCraft-Adversary-Agent-Challenge
View on GitHub
☆12Aug 24, 2021Updated 4 years ago
mikelma / componet
View on GitHub
Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)
☆29Jul 20, 2024Updated 2 years ago
IouJenLiu / CMAE
View on GitHub
☆50Jul 23, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
spandanagella / verse
View on GitHub
Visual Verb Sense Disambiguation
☆13Apr 26, 2019Updated 7 years ago
arseniycodes / minirt
View on GitHub
🔦 A minimal raytracing engine in written in C on MinilibX
☆10Mar 23, 2021Updated 5 years ago
daisatojp / mpo
View on GitHub
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
☆84Nov 19, 2022Updated 3 years ago
AiltonOliveir / RL-env-for-communications
View on GitHub
Reinforcement learning environment for MIMO communications.
☆15Jul 2, 2021Updated 5 years ago
semitable / seps
View on GitHub
Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)
☆10Oct 26, 2021Updated 4 years ago
timoklein / redo
View on GitHub
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
☆34Oct 22, 2024Updated last year
BaoqianWang / IROS22_DARL1N
View on GitHub
☆14Jul 27, 2022Updated 3 years ago