znowu/mirror-learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/znowu/mirror-learning)

znowu / mirror-learning

The code for experiments conducted to verify the correctness of mirror learning.

☆11

Alternatives and similar repositories for mirror-learning

Users that are interested in mirror-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

luchris429 / discovered-policy-optimisation
View on GitHub
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆12Jun 15, 2023Updated 3 years ago
louiskirsch / vsml-neurips2021
View on GitHub
Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905
☆33Jan 9, 2022Updated 4 years ago
menglinjian / Deep-FTRL-ORW
View on GitHub
Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…
☆11Dec 1, 2022Updated 3 years ago
Stanford-ILIAD / Conventions-ModularPolicy
View on GitHub
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
☆15Mar 9, 2021Updated 5 years ago
smonsays / metax
View on GitHub
flexible meta-learning in jax
☆16Oct 19, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
vecna-labs / open-trajectory-gym
View on GitHub
Agentic RL post-training framework
☆30May 28, 2026Updated last month
Tyan66666 / WanSync
View on GitHub
WanSync(顽爪爪同步) 一款轻量级 Flutter 应用，可自动从顽鹿（OneLap）下载 FIT 运动文件并批量同步到 Strava、行者、Intervals.icu 和 Outbase。支持通过系统分享直接导入 FIT 文件上传，提供 API / 网页双模式上传，…
☆20Jul 15, 2026Updated last week
gcucurull / jax-gat
View on GitHub
JAX implementation of Graph Attention Networks
☆13Jan 29, 2022Updated 4 years ago
automl / hypersweeper
View on GitHub
Hydra sweeper integration of our favorite optimization packages, utilizing ask-and-tell interfaces.
☆16Nov 14, 2025Updated 8 months ago
finlayiainmaclean / rdsl
View on GitHub
Selection language for RDKit
☆17Mar 20, 2026Updated 4 months ago
instadeepai / matrax
View on GitHub
A collection of matrix games in JAX
☆14Apr 13, 2026Updated 3 months ago
manantomar / Mirror-Descent-Policy-Optimization
View on GitHub
Mirror Descent Policy Optimization
☆43Oct 31, 2020Updated 5 years ago
ElisevanderPol / mdp-homomorphic-networks
View on GitHub
☆32Feb 20, 2021Updated 5 years ago
niki-amini-naieni / CounTX
View on GitHub
Includes FSC-147-D and the code for training and testing the CounTX model from the paper Open-world Text-specified Object Counting.
☆42Sep 27, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
vpj / jax_transformer
View on GitHub
Autoregressive transformer in JAX from scratch
☆23Jan 28, 2022Updated 4 years ago
tristandeleu / jax-meta-learning
View on GitHub
A collection of meta-learning algorithms in Jax
☆24Sep 3, 2022Updated 3 years ago
hcmlab / GANterfactual-RL
View on GitHub
Counterfactual explanations for Reinforcement Learning agents on Atari
☆12Apr 3, 2023Updated 3 years ago
epigramai / tfserving-simple-example
View on GitHub
How to save a model for tfserving
☆11Jan 13, 2018Updated 8 years ago
iBims1JFK / gait_in_eight
View on GitHub
This repository contains the code base for the paper "Gait in Eight: Efficient On-Robot Learning for Omnidirectional Quadruped Locomotion…
☆14Mar 8, 2025Updated last year
google / putting-dune
View on GitHub
☆10Feb 20, 2024Updated 2 years ago
hamishs / JAX-RL
View on GitHub
JAX implementations of various deep reinforcement learning algorithms.
☆25Feb 2, 2025Updated last year
zecevic-matej / ESSAI-2023-Causality
View on GitHub
European Summer School on AI Course "Machines Climbing Pearl's Ladder of Causation"
☆13Feb 20, 2024Updated 2 years ago
mueller-mp / maha-norm
View on GitHub
☆16May 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
abhayraw1 / planet-torch
View on GitHub
A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning
☆13Aug 31, 2020Updated 5 years ago
ElisevanderPol / symmetrizer
View on GitHub
☆32Feb 21, 2021Updated 5 years ago
luchris429 / popjaxrl
View on GitHub
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆116Dec 5, 2023Updated 2 years ago
sash-a / CleanRL.jl
View on GitHub
Simple single file implementations of Reinforcement Learning algorithms in Julia
☆24Feb 15, 2025Updated last year
epignatelli / reinforcement-learning-an-introduction
View on GitHub
A python implementation of the concepts in the book "Reinforcement Learning: An Introduction" by R.S. Sutton and A. G. Barto.
☆21Jul 13, 2020Updated 6 years ago
notmahi / disk
View on GitHub
PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…
☆21Mar 22, 2022Updated 4 years ago
VArdulov / ToMNet
View on GitHub
Reimplementation of ToMNet with some extensions for RL as well
☆14Apr 28, 2018Updated 8 years ago
mengzili / jemdoc-python3
View on GitHub
An unofficial Python 3 version of jemdoc.
☆12Feb 8, 2026Updated 5 months ago
epignatelli / discovering-reinforcement-learning-algorithms
View on GitHub
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…
☆23Dec 22, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ben-eysenbach / info_geometry
View on GitHub
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Oct 6, 2021Updated 4 years ago
flowersteam / teachDeepRL
View on GitHub
☆93Jun 8, 2021Updated 5 years ago
jellylee / javawebEstore
View on GitHub
用jsp+servlet实现的网上购物系统，包含用户权限控制。
☆12May 18, 2017Updated 9 years ago
Michael-Beukman / RobocupGym
View on GitHub
Reinforcement Learning inside a 3D soccer simulation
☆37Sep 15, 2024Updated last year
world-modelz / world-modelz
View on GitHub
video prediction and world model research
☆14Jun 10, 2022Updated 4 years ago
vasily789 / adaptive-weighted-gans
View on GitHub
☆13Jan 13, 2022Updated 4 years ago
dsblank / jupyter.brynmawr
View on GitHub
Templates and custom config for Bryn Mawr College Physics Jupyterhub server
☆12Jun 5, 2017Updated 9 years ago