instadeepai/outer-value-function-meta-rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/instadeepai/outer-value-function-meta-rl)

instadeepai / outer-value-function-meta-rl

Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function

☆13

Alternatives and similar repositories for outer-value-function-meta-rl

Users that are interested in outer-value-function-meta-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

instadeepai / sebulba
View on GitHub
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
☆61Oct 23, 2023Updated 2 years ago
DeepChainBio / deepchain-apps
View on GitHub
A library for deploying App on deepchain.bio
☆31Sep 24, 2021Updated 4 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
instadeepai / poppy
View on GitHub
Population-Based Reinforcement Learning for Combinatorial Optimization
☆87Feb 12, 2024Updated 2 years ago
DeepChainBio / bio-datasets
View on GitHub
Free collection of Bio datasets and embeddings
☆35Oct 10, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
instadeepai / DEgym
View on GitHub
A LLM-friendly framework for translating dynamical equations to gymnasium-compatible RL environments.
☆33Mar 18, 2026Updated 4 months ago
cogment / cogment-lab
View on GitHub
A toolkit for practical Human-AI cooperation research
☆14Apr 19, 2024Updated 2 years ago
hca-neurips2019 / hca
View on GitHub
Algorithms described in the paper Hindsight Credit Assignment (NeurIPS 2019).
☆11Oct 27, 2019Updated 6 years ago
instadeepai / awesome-marl
View on GitHub
A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers
☆58Jan 20, 2023Updated 3 years ago
irhum / esmjax
View on GitHub
ESM2 protein language models in JAX/Flax
☆19Oct 10, 2022Updated 3 years ago
niudong1001 / learn-ai
View on GitHub
存储在学习人工智能（AI）中涉及到的各种基础知识，工具，模型，算法，代码等。
☆14Mar 10, 2019Updated 7 years ago
maxjcohen / vqvae
View on GitHub
VQ-VAE implementation in pytorch, supporting EMA and Gumbel trainings. Applicable for images and time series.
☆11Oct 19, 2022Updated 3 years ago
instadeepai / catx
View on GitHub
🐈‍⬛ Contextual bandits library for continuous action trees with smoothing in JAX
☆71Oct 7, 2022Updated 3 years ago
instadeepai / marl-eval
View on GitHub
A tool for aggregating and plotting MARL experiment data.
☆86Apr 13, 2026Updated 3 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
instadeepai / manyfold
View on GitHub
🧬 ManyFold: An efficient and flexible library for training and validating protein folding models
☆80Dec 14, 2022Updated 3 years ago
EmptyJackson / groove
View on GitHub
Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…
☆36Jun 28, 2024Updated 2 years ago
HarryXD2018 / PyTex
View on GitHub
A python tool that generate latex(e.g. Table, matrix) code.
☆10Jun 22, 2022Updated 4 years ago
hr0nix / dejax
View on GitHub
Accelerated replay buffers in JAX
☆46Sep 17, 2022Updated 3 years ago
noegroup / rigid-flows
View on GitHub
☆12Dec 13, 2023Updated 2 years ago
instadeepai / matrax
View on GitHub
A collection of matrix games in JAX
☆14Apr 13, 2026Updated 3 months ago
alecwangcq / f-divergence-dpo
View on GitHub
Direct preference optimization with f-divergences.
☆17Nov 3, 2024Updated last year
PabloPiaggi / Crystallization-of-IceIh
View on GitHub
Input files and results of paper: Phase equilibrium of liquid water and hexagonal from ice enhanced sampling molecular dynamics simulatio…
☆10Apr 9, 2021Updated 5 years ago
irom-princeton / PAC-Imitation
View on GitHub
Code for Generalization Guarantees for (Multi-Modal) Imitation Learning
☆11Jul 14, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
jinglescode / unity-ml-agents-turret-defense
View on GitHub
A reinforcement learning agent playing as the turret, where its goal is to allow ten friendly units to enter the base, and loses if an en…
☆14Dec 24, 2020Updated 5 years ago
jiasenlu / vit-vqgan-jax
View on GitHub
Jax implementation of VIT-VQGAN
☆10Jan 25, 2024Updated 2 years ago
admin-ch / CovidCode-UI
View on GitHub
CovidCode UI is a web application that allows physicians to generate authorization code. Patient can then submit his seed secret key in t…
☆10Jul 19, 2023Updated 3 years ago
instadeepai / flashbax
View on GitHub
⚡ Flashbax: Accelerated Replay Buffers in JAX
☆278Sep 22, 2025Updated 9 months ago
xmu-rl-3dv / DoF
View on GitHub
☆18Feb 24, 2025Updated last year
mackelab / multispike_tempotron
View on GitHub
To identify features by aggregate-label learning in spiking neurons
☆14Feb 19, 2018Updated 8 years ago
choderalab / chiron
View on GitHub
Differentiable Markov Chain Monte Carlo
☆15Mar 23, 2024Updated 2 years ago
guytenn / Act2Vec
View on GitHub
☆13May 10, 2019Updated 7 years ago
lollcat / fab-jax
View on GitHub
Flow Annealed Importance Sampling Bootstrap (FAB) with JAX.
☆13Jun 12, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dsfsi / deadlines
View on GitHub
AI/ML/DS conference/workshop/event deadlines on the African continent
☆25Updated this week
multi-ego / multi-eGO
View on GitHub
Set of tools to generate a multi-eGO force field to perform molecular dynamics simulations
☆16Updated this week
clement-bonnet / lpn
View on GitHub
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆112Nov 25, 2025Updated 7 months ago
sash-a / CleanRL.jl
View on GitHub
Simple single file implementations of Reinforcement Learning algorithms in Julia
☆24Feb 15, 2025Updated last year
brownirl / lambda_discrepancy
View on GitHub
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
☆24Oct 28, 2024Updated last year
google-research / reincarnating_rl
View on GitHub
[NeurIPS 2022] Open source code for reusing prior computational work in RL.
☆100Jul 5, 2023Updated 3 years ago
seoklab / GalaxyWater-CNN
View on GitHub
3D-CNN based water position prediction method
☆11Nov 20, 2023Updated 2 years ago