zdhNarsil/Stochastic-Marginal-Actor-Critic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zdhNarsil/Stochastic-Marginal-Actor-Critic)

zdhNarsil / Stochastic-Marginal-Actor-Critic

Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".

☆24

Alternatives and similar repositories for Stochastic-Marginal-Actor-Critic

Users that are interested in Stochastic-Marginal-Actor-Critic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

boschresearch / ube-mbrl
View on GitHub
Model-Based Uncertainty in Value Functions (AISTATS2023)
☆16Feb 28, 2023Updated 3 years ago
baturaysaglam / actor-prioritized-exp-replay
View on GitHub
Actor Prioritized Experience Replay
☆19Nov 20, 2023Updated 2 years ago
thethaibinh / agile_flight
View on GitHub
Simulation system for path planning evaluation
☆13Dec 13, 2025Updated 7 months ago
Howuhh / streaming-drl-jax
View on GitHub
streaming deep reinforcement learning but 4x faster with jax!
☆19Jan 4, 2026Updated 6 months ago
uoe-agents / MATE
View on GitHub
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
☆15Apr 25, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
ALRhub / MTS3
View on GitHub
Implementation of Neurips 2023 Paper "Multi Time Scale World Models"
☆18Nov 8, 2024Updated last year
sweetice / PEER-CVPR23
View on GitHub
Authors' implementation of PEER
☆11Jul 13, 2023Updated 3 years ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
Michael-Beukman / RobocupGym
View on GitHub
Reinforcement Learning inside a 3D soccer simulation
☆37Sep 15, 2024Updated last year
zdhNarsil / Distributional-GFlowNets
View on GitHub
Code for our TMLR paper "Distributional GFlowNets with Quantile Flows".
☆13Feb 14, 2024Updated 2 years ago
uoe-agents / CMID
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
franrruiz / uivi
View on GitHub
Code for Unbiased Implicit Variational Inference (UIVI)
☆15Jan 18, 2019Updated 7 years ago
newera-001 / motor-system
View on GitHub
A project copied from google-research which named motion-imitation was rewrited with PyTorch
☆10Sep 30, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
lollcat / Aspen-RL
View on GitHub
Reinforcement learning for chemical engineering process design with Aspen Simulator.
☆20Mar 8, 2023Updated 3 years ago
eilab-gt / NovGrid
View on GitHub
Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …
☆34May 21, 2024Updated 2 years ago
BruceGeLi / TCE_RL
View on GitHub
Temporally Correlated Episodic Reinforcement Learning, ICLR 24
☆12Apr 8, 2024Updated 2 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
clvrai / create
View on GitHub
CREATE Environment for long-horizon physics-puzzle tasks with diverse tools
☆18Nov 22, 2022Updated 3 years ago
automl / arlbench
View on GitHub
HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient
☆32Jun 16, 2026Updated last month
abhayraw1 / planet-torch
View on GitHub
A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning
☆13Aug 31, 2020Updated 5 years ago
hu-po / pySACQ
View on GitHub
PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
☆39Feb 13, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
symoon11 / dreamerv3-flax
View on GitHub
Flax Implementation of DreamerV3 on Crafter
☆18Nov 29, 2025Updated 7 months ago
google-deepmind / nao_top10
View on GitHub
☆19Mar 1, 2023Updated 3 years ago
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
stepelu / idbm-pytorch
View on GitHub
☆13Sep 13, 2023Updated 2 years ago
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
shwangtangjun / SVGD-PyTorch
View on GitHub
A PyTorch implementation of SVGD (Stein Variational Gradient Descent), contains all examples including bayesian inference in the paper
☆12Jul 30, 2020Updated 5 years ago
nmonette / NCC-UED
View on GitHub
Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025
☆17Nov 24, 2025Updated 8 months ago
subho406 / Recurrent-PPO-Jax
View on GitHub
Implementation of Proximal Policy Optimization in Jax+Flax
☆21May 18, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ben-eysenbach / mnm
View on GitHub
Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"
☆21Oct 6, 2021Updated 4 years ago
LihaoR / Entropy-Regularized-RL
View on GitHub
soft q learning and soft actor critic
☆16Dec 23, 2018Updated 7 years ago
facebookresearch / how-to-autorl
View on GitHub
Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…
☆88Nov 27, 2023Updated 2 years ago
facebookresearch / semi-discrete-flow
View on GitHub
code for "Semi-Discrete Normalizing Flows through Differentiable Tessellation"
☆26Dec 10, 2022Updated 3 years ago
clvrai / new-actions-rl
View on GitHub
☆24Aug 9, 2024Updated last year
RajGhugare19 / stitching-is-combinatorial-generalisation
View on GitHub
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆25Apr 19, 2024Updated 2 years ago
Doraemonzzz / hgru-pytorch
View on GitHub
☆29Jul 9, 2024Updated 2 years ago