karush17/emix

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/karush17/emix)

karush17 / emix

Energy-based Surprise Minimization for Multi-Agent Value Factorization

☆12

Alternatives and similar repositories for emix

Users that are interested in emix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

karush17 / esac
View on GitHub
Evolution-based Soft Actor-Critic (ESAC)
☆42Jul 25, 2024Updated 2 years ago
karush17 / Evolution-Strategies-PyTorch
View on GitHub
Implementation of OpenAI's Evolution Strategies in PyTorch.
☆20Apr 22, 2020Updated 6 years ago
karush17 / Deep-Eligibility-Traces
View on GitHub
Implementation of Eligibility Traces with Neural Networks in PyTorch and Tensorflow 2.0
☆26Sep 10, 2021Updated 4 years ago
karush17 / Hierarchical-Attention-Reinforcement-Learning
View on GitHub
Hierarchical Attention in Reinforcement Learning for Stock Order Executions
☆33Apr 7, 2021Updated 5 years ago
mgerstgrasser / super
View on GitHub
suPER is a collaborative multi-agent RL algorithm
☆14Jun 11, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Neo-X / SMiRL_Code
View on GitHub
☆20Nov 13, 2022Updated 3 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
saizhang0218 / VBC
View on GitHub
pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"
☆54Dec 8, 2022Updated 3 years ago
simsimiSION / pymarl-algorithm-extension-via-starcraft
View on GitHub
☆13Aug 15, 2020Updated 5 years ago
vitchyr / torch-rl
View on GitHub
A reinforcement learning package implemented in Torch
☆11Jan 24, 2016Updated 10 years ago
yuchen-x / MacroMARL
View on GitHub
☆26Apr 16, 2024Updated 2 years ago
wendelinboehmer / dcg
View on GitHub
☆77Jun 2, 2024Updated 2 years ago
KornbergFresnel / CommNet
View on GitHub
an implementation of CommNet
☆35Nov 14, 2017Updated 8 years ago
vickipedia6 / Tennis-Deep-Reinforcement-Learning
View on GitHub
Training Multiple agents in the same environment to collaborate and compete with each other
☆12Dec 1, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
schroederdewitt / mackrl
View on GitHub
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆34Dec 1, 2019Updated 6 years ago
nshepperd / gumbel-rao-pytorch
View on GitHub
☆11Jul 25, 2021Updated 5 years ago
saizhang0218 / TMC
View on GitHub
Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"
☆27Dec 6, 2020Updated 5 years ago
wjh720 / QPLEX
View on GitHub
☆105Nov 13, 2020Updated 5 years ago
montrealrobotics / iv_rl
View on GitHub
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Jul 18, 2025Updated last year
ztjhz / t5-jax
View on GitHub
JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
☆24Jun 10, 2023Updated 3 years ago
yardenas / jax-dreamer
View on GitHub
Dreamer on JAX
☆16Jan 19, 2022Updated 4 years ago
lywang3081 / MRDC
View on GitHub
Memory Replay with Data Compression (ICLR 2022)
☆16Sep 26, 2023Updated 2 years ago
pokaxpoka / rad_procgen
View on GitHub
RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)
☆19Mar 29, 2021Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
milarobotlearningcourse / mini_crossformer
View on GitHub
☆16Aug 15, 2025Updated 11 months ago
agakshat / maddpg
View on GitHub
Implementation of Multi-Agent Deep Deterministic Policy Gradients
☆39Mar 28, 2018Updated 8 years ago
instadeepai / qd-skill-discovery-benchmark
View on GitHub
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery
☆17Apr 2, 2026Updated 3 months ago
mahaozhe / ReLara
View on GitHub
[ICML 2024] The algorithm of Reinforcement Learning with an Assistant Reward Agent (ReLara)
☆17Aug 2, 2024Updated last year
rh01 / deeprm
View on GitHub
Deep reinforcement learning for resource managment and job schedule. it is inspired by deeprm model and I will implement for in practica…
☆12Jun 14, 2019Updated 7 years ago
chanind / linear-relational
View on GitHub
Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch
☆11Aug 7, 2024Updated last year
QDPP-GitHub / QDPP
View on GitHub
Multi-Agent Determinantal Q-Learning
☆43Nov 22, 2022Updated 3 years ago
andylolu2 / jax-diffusion
View on GitHub
Implementation of Denoising Diffusion Probabilistic Models (DDPM) in JAX and Flax.
☆22Oct 12, 2023Updated 2 years ago
BrightFeather / deeprm_conv
View on GitHub
Based on Hongzi Mao's works of deeprm: https://github.com/hongzimao/deeprm
☆12Jun 11, 2017Updated 9 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tristandeleu / jax-meta-learning
View on GitHub
A collection of meta-learning algorithms in Jax
☆24Sep 3, 2022Updated 3 years ago
Dawn0523 / LAIES
View on GitHub
☆18Jul 14, 2023Updated 3 years ago
N3PDF / evolutionary_keras
View on GitHub
An evolutionary algorithm implementation for Keras
☆10Jan 4, 2021Updated 5 years ago
jiechuanjiang / I2Q
View on GitHub
I2Q: A Fully Decentralized Q-Learning Algorithm
☆19Nov 10, 2022Updated 3 years ago
akiani / rlsepsis234
View on GitHub
CS234 Sepsis Simulator For RL
☆18Dec 8, 2022Updated 3 years ago
tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 3 years ago
tencent-ailab / Arena
View on GitHub
☆11Mar 10, 2021Updated 5 years ago