brownirl/lambda_discrepancy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/brownirl/lambda_discrepancy)

brownirl / lambda_discrepancy

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

☆24

Alternatives and similar repositories for lambda_discrepancy

Users that are interested in lambda_discrepancy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

taodav / pobax
View on GitHub
Partially Observable Benchmarks in JAX
☆25Apr 30, 2026Updated 2 months ago
zombie-einstein / jaxpr-viz
View on GitHub
Jaxpr Visualisation Tool
☆37Dec 22, 2024Updated last year
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
dunnolab / vintix
View on GitHub
Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025
☆51May 23, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
google-deepmind / nao_top10
View on GitHub
☆19Mar 1, 2023Updated 3 years ago
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
Howuhh / streaming-drl-jax
View on GitHub
streaming deep reinforcement learning but 4x faster with jax!
☆19Jan 4, 2026Updated 6 months ago
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 9 months ago
nmonette / NCC-UED
View on GitHub
Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025
☆17Nov 24, 2025Updated 7 months ago
corl-team / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆43Aug 22, 2023Updated 2 years ago
dunnolab / vintix-II
View on GitHub
Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner - - = ICLR 2026
☆16Apr 8, 2026Updated 3 months ago
sash-a / CleanRL.jl
View on GitHub
Simple single file implementations of Reinforcement Learning algorithms in Julia
☆24Feb 15, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
corl-team / ad-eps
View on GitHub
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
☆35Sep 18, 2024Updated last year
tinkoff-ai / lb-sac
View on GitHub
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…
☆21Feb 27, 2023Updated 3 years ago
amacrutherford / sampling-for-learnability
View on GitHub
Official codebase for "Sampling For Learnability", published at NeurIPS 2024
☆24Oct 21, 2025Updated 8 months ago
riiswa / pointax
View on GitHub
Pointax: PointMaze Environment for JAX
☆28Oct 22, 2025Updated 8 months ago
amidos2006 / pcg_benchmark
View on GitHub
A benchmark to test and compare your pcg algorithm against each other
☆24Sep 22, 2025Updated 9 months ago
ini / multigrid
View on GitHub
Fast and flexible multi-agent gridworld reinforcement learning environments.
☆50Mar 25, 2025Updated last year
machado-research / AgarCL
View on GitHub
Agar.io for Continual Reinforcement Learning
☆24Jul 24, 2025Updated 11 months ago
JuliaPOMDP / AdaOPS.jl
View on GitHub
An implementation of the AdaOPS (Adaptive Online Packing-based Search), which is an online POMDP Solver used to solve problems defined wi…
☆16Nov 16, 2025Updated 8 months ago
MarcoMeter / endless-memory-gym
View on GitHub
Challenging Memory-based Deep Reinforcement Learning Agents
☆114Oct 27, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
epignatelli / navix
View on GitHub
Accelerated minigrid environments with JAX
☆175Oct 20, 2025Updated 9 months ago
google-deepmind / dmc_vision_benchmark
View on GitHub
☆34Jun 21, 2024Updated 2 years ago
young-geng / mintext
View on GitHub
Minimal but scalable implementation of large language models in JAX
☆34Nov 28, 2025Updated 7 months ago
instadeepai / matrax
View on GitHub
A collection of matrix games in JAX
☆14Apr 13, 2026Updated 3 months ago
robfiras / s2pg
View on GitHub
Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"
☆25May 5, 2024Updated 2 years ago
eilab-gt / NovGrid
View on GitHub
Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …
☆34May 21, 2024Updated 2 years ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
luchris429 / discovered-policy-optimisation
View on GitHub
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆12Jun 15, 2023Updated 3 years ago
SpirinEgor / gulag
View on GitHub
GULAG: GUessing LAnGuages with neural networks
☆13May 4, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
dunnolab / xland-minigrid-datasets
View on GitHub
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025
☆84Feb 13, 2025Updated last year
jsikyoon / OCRL
View on GitHub
Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…
☆12Feb 23, 2024Updated 2 years ago
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
Michael-Beukman / RobocupGym
View on GitHub
Reinforcement Learning inside a 3D soccer simulation
☆37Sep 15, 2024Updated last year
flowersteam / vivarium
View on GitHub
Multi-agent simulator in Jax for research and teaching in AI & ALife
☆31Apr 11, 2026Updated 3 months ago
aadimator / JaxARC
View on GitHub
A High-Throughput JAX-native Environment for Abstraction and Reasoning Research
☆16May 4, 2026Updated 2 months ago
desmond-ong / TAC-EA-model
View on GitHub
Codebase for EA Modeling (for Transactions on Affective Computing paper)
☆12Dec 8, 2022Updated 3 years ago