Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
☆13Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for outer-value-function-meta-rl
Users that are interested in outer-value-function-meta-rl are comparing it to the libraries listed below
Sorting:
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆61Oct 23, 2023Updated 2 years ago
- A library for deploying App on deepchain.bio☆31Sep 24, 2021Updated 4 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Population-Based Reinforcement Learning for Combinatorial Optimization☆87Feb 12, 2024Updated 2 years ago
- Free collection of Bio datasets and embeddings☆35Oct 10, 2022Updated 3 years ago
- A toolkit for practical Human-AI cooperation research☆14Apr 19, 2024Updated last year
- ☆14Feb 24, 2025Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆56Jan 20, 2023Updated 3 years ago
- 存储在学习人工智能(AI)中涉及到的各种基础知识,工具,模型,算法,代码等。☆14Mar 10, 2019Updated 7 years ago
- VQ-VAE implementation in pytorch, supporting EMA and Gumbel trainings. Applicable for images and time series.☆11Oct 19, 2022Updated 3 years ago
- Code for Generalization Guarantees for (Multi-Modal) Imitation Learning☆11Jul 14, 2022Updated 3 years ago
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- 🐈⬛ Contextual bandits library for continuous action trees with smoothing in JAX☆71Oct 7, 2022Updated 3 years ago
- A tool for aggregating and plotting MARL experiment data.☆83Jan 26, 2026Updated last month
- 🧬 ManyFold: An efficient and flexible library for training and validating protein folding models☆81Dec 14, 2022Updated 3 years ago
- My submission to the ARC-AGI-3 Developer Preview Agent Compitition.☆43Jan 27, 2026Updated last month
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆35Jun 28, 2024Updated last year
- ESM2 protein language models in JAX/Flax☆18Oct 10, 2022Updated 3 years ago
- A python tool that generate latex(e.g. Table, matrix) code.☆10Jun 22, 2022Updated 3 years ago
- A collection of matrix games in JAX☆13Nov 28, 2024Updated last year
- ☆12Dec 13, 2023Updated 2 years ago
- Input files and results of paper: Phase equilibrium of liquid water and hexagonal from ice enhanced sampling molecular dynamics simulatio…☆10Apr 9, 2021Updated 4 years ago
- Direct preference optimization with f-divergences.☆16Nov 3, 2024Updated last year
- A reinforcement learning agent playing as the turret, where its goal is to allow ten friendly units to enter the base, and loses if an en…☆14Dec 24, 2020Updated 5 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- CovidCode UI is a web application that allows physicians to generate authorization code. Patient can then submit his seed secret key in t…☆11Jul 19, 2023Updated 2 years ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆274Sep 22, 2025Updated 5 months ago
- To identify features by aggregate-label learning in spiking neurons☆14Feb 19, 2018Updated 8 years ago
- Differentiable Markov Chain Monte Carlo☆15Mar 23, 2024Updated last year
- Flow Annealed Importance Sampling Bootstrap (FAB) with JAX.☆13Jun 12, 2024Updated last year
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆100Jul 5, 2023Updated 2 years ago
- AI/ML/DS conference/workshop/event deadlines on the African continent☆23Updated this week
- Simple single file implementations of Reinforcement Learning algorithms in Julia☆23Feb 15, 2025Updated last year
- Minimal yet performant LLM examples in pure JAX☆244Jan 14, 2026Updated 2 months ago
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆32Mar 9, 2026Updated last week
- Set of tools to generate a multi-eGO force field to perform molecular dynamics simulations☆15Mar 11, 2026Updated last week
- RND1: Scaling Diffusion Language Models☆176Feb 22, 2026Updated 3 weeks ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year