Neo-X/SMiRL_Code

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Neo-X/SMiRL_Code)

Neo-X / SMiRL_Code

☆20

Alternatives and similar repositories for SMiRL_Code

Users that are interested in SMiRL_Code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ArnaudFickinger / adversarial-surprise
View on GitHub
Explore and Control with Adversarial Surprise
☆10Jul 20, 2021Updated 5 years ago
karush17 / emix
View on GitHub
Energy-based Surprise Minimization for Multi-Agent Value Factorization
☆12Oct 20, 2023Updated 2 years ago
Dawn0523 / LAIES
View on GitHub
☆18Jul 14, 2023Updated 3 years ago
LXXXXR / ICES
View on GitHub
[ICML' 24] The PyTorch implementation of our paper: "Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforc…
☆25May 29, 2024Updated 2 years ago
roger-creus / ale-nl
View on GitHub
A framework for evaluating LLMs in Atari games
☆15Apr 21, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
uoe-agents / CMID
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
milarobotlearningcourse / mini_crossformer
View on GitHub
☆16Aug 15, 2025Updated 11 months ago
LeapLabTHU / MOSS
View on GitHub
Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning
☆23Nov 16, 2022Updated 3 years ago
yuchen-x / MacroMARL
View on GitHub
☆26Apr 16, 2024Updated 2 years ago
anniesch / single-life-rl
View on GitHub
Single-Life Reinforcement Learning
☆14Dec 17, 2022Updated 3 years ago
aijunbai / thompson-sampling
View on GitHub
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆15Jun 20, 2016Updated 10 years ago
renwang435 / pgr
View on GitHub
Prioritized Generative Replay (ICLR 2025 Oral)
☆29Mar 1, 2025Updated last year
holarissun / RewardShifting
View on GitHub
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆29Oct 29, 2023Updated 2 years ago
baimingc / delay-aware-MBRL
View on GitHub
Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".
☆29Feb 8, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhixuan-lin / G-SWM
View on GitHub
Official PyTorch implementation of "Improving Generative Imagination in Object-Centric World Models"
☆36Dec 8, 2022Updated 3 years ago
young-geng / SimpleSAC
View on GitHub
A simple and easy to use implementation of the soft actor-critic algorithm.
☆15Sep 2, 2022Updated 3 years ago
Bottle101 / aerial_autonomy_development_environment
View on GitHub
Leveraging system development and robot deployment for aerial autonomous navigation.
☆11Feb 23, 2026Updated 4 months ago
vickipedia6 / Tennis-Deep-Reinforcement-Learning
View on GitHub
Training Multiple agents in the same environment to collaborate and compete with each other
☆12Dec 1, 2019Updated 6 years ago
vivekmyers / empowerment_successor_representations
View on GitHub
Code for the paper "Learning to Assist Humans without Inferring Rewards"
☆20Jul 7, 2024Updated 2 years ago
kikojay / EMC
View on GitHub
The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.
☆40Feb 16, 2023Updated 3 years ago
lasgroup / aceirl
View on GitHub
Implementation of "Active Exploration for Inverse Reinforcement Learning (AceIRL), NeurIPS 2022.
☆14Oct 12, 2022Updated 3 years ago
acyclics / MPO
View on GitHub
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
☆29Sep 10, 2020Updated 5 years ago
GGchen1997 / BDI
View on GitHub
This repository is the official implementation of Bidirectional Learning for Offline Infinite-width Model-based Optimization (NeurIPS 202…
☆14Jan 19, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
deligentfool / COLA
View on GitHub
Codes for the paper "Consensus Learning for Cooperative Multi-Agent Reinforcement Learning"
☆18Aug 15, 2022Updated 3 years ago
tmoer / a0c
View on GitHub
Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)
☆15Jan 19, 2021Updated 5 years ago
Div-Infinity / XQL
View on GitHub
Extreme Q-Learning: Max Entropy RL without Entropy
☆88Feb 14, 2023Updated 3 years ago
mgerstgrasser / super
View on GitHub
suPER is a collaborative multi-agent RL algorithm
☆14Jun 11, 2024Updated 2 years ago
ziyadsheeba / qfat
View on GitHub
[NeurIPS 2025, Spotlight] An official implementation of the paper Quantization-Free Autoregressive Action Transformer
☆11Mar 3, 2026Updated 4 months ago
marcbrittain / Prioritized-Sequence-Experience-Replay
View on GitHub
Prioritized Sequence Experience Replay
☆10Aug 16, 2021Updated 4 years ago
WeihaoTan / gym-macro-overcooked
View on GitHub
☆16May 11, 2023Updated 3 years ago
ml-jku / reactive-exploration
View on GitHub
Code for the paper "Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning"
☆16Jul 4, 2022Updated 4 years ago
maximilianigl / rl-iter
View on GitHub
Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
☆11Jun 8, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
owencqueen / RL_Final_Project
View on GitHub
"Adaptive Cruise Control for a Hybrid Vehicle with Deep Policy Gradients". Final project for ECE 517/414 Reinforcement Learning.
☆13Dec 8, 2021Updated 4 years ago
MartinQingM / Intersection-simulation
View on GitHub
Build a very simple intersection assistant driving system simulation based on SUMO and TraCI4Matlab
☆10Feb 24, 2019Updated 7 years ago
DrZero0 / MACC
View on GitHub
The implementation of IJCAI'22 paper "Multi-Agent Concentrative Coordination with Decentralized Task Representation".
☆18May 1, 2022Updated 4 years ago
projectaligned / chchanges
View on GitHub
Bayesian Online Changepoint Detection
☆13Aug 31, 2020Updated 5 years ago
sihyun-yu / RoMA
View on GitHub
[NeurIPS'21] RoMA: Robust Model Adaptation for Offline Model-based Optimization
☆15Oct 28, 2021Updated 4 years ago
d5rlbenchmark / d5rl
View on GitHub
☆31Oct 3, 2023Updated 2 years ago
facebookresearch / icp-block-mdp
View on GitHub
Invariant Causal Prediction for Block MDPs
☆44Jun 11, 2020Updated 6 years ago