corl-team/headless-ad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/corl-team/headless-ad)

corl-team / headless-ad

Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"

☆91

Alternatives and similar repositories for headless-ad

Users that are interested in headless-ad are comparing it to the libraries listed below

Sorting:

corl-team / ad-eps
View on GitHub
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
☆34Sep 18, 2024Updated last year
dunnolab / vintix
View on GitHub
Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025
☆45May 23, 2025Updated 9 months ago
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆31Oct 12, 2023Updated 2 years ago
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated last year
dunnolab / xland-minigrid-datasets
View on GitHub
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025
☆81Feb 13, 2025Updated last year
dunnolab / phi-module
View on GitHub
[ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…
☆17Jun 12, 2025Updated 8 months ago
corl-team / rebased
View on GitHub
Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"
☆169Jan 16, 2025Updated last year
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆56Feb 3, 2023Updated 3 years ago
corl-team / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆43Aug 22, 2023Updated 2 years ago
jon--lee / decision-pretrained-transformer
View on GitHub
Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…
☆78May 28, 2024Updated last year
tinkoff-ai / lb-sac
View on GitHub
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…
☆21Feb 27, 2023Updated 3 years ago
Dragon-Zhuang / Reinformer
View on GitHub
Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL
☆46Oct 16, 2024Updated last year
brownirl / lambda_discrepancy
View on GitHub
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
☆23Oct 28, 2024Updated last year
ai-forever / fbc3_aij2023
View on GitHub
☆22Oct 4, 2023Updated 2 years ago
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated last year
riiswa / pointax
View on GitHub
Pointax: PointMaze Environment for JAX
☆26Oct 22, 2025Updated 4 months ago
tinkoff-ai / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆62Aug 3, 2023Updated 2 years ago
dunnolab / awesome-in-context-rl
View on GitHub
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
☆285Sep 8, 2025Updated 5 months ago
corl-team / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆627Feb 10, 2024Updated 2 years ago
VikhrModels / effective_llm_alignment
View on GitHub
Effective LLM Alignment Toolkit
☆152Jun 25, 2025Updated 8 months ago
kvfrans / powderworld
View on GitHub
Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
☆73Aug 31, 2024Updated last year
sail-sg / ContinualBench
View on GitHub
☆19May 20, 2025Updated 9 months ago
enjeeneer / zero-shot-rl
View on GitHub
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆26Jan 14, 2025Updated last year
MarcoMeter / endless-memory-gym
View on GitHub
Challenging Memory-based Deep Reinforcement Learning Agents
☆111Oct 27, 2024Updated last year
luchris429 / JaxLife
View on GitHub
An Open-Ended Agentic Simulator
☆60Aug 11, 2024Updated last year
Artur-Galstyan / jaxonloader
View on GitHub
A dataloader, but for JAX
☆20May 17, 2024Updated last year
robfiras / s2pg
View on GitHub
Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"
☆25May 5, 2024Updated last year
dunnolab / xland-minigrid
View on GitHub
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
☆325Dec 16, 2025Updated 2 months ago
hwang-ua / inac_pytorch
View on GitHub
☆19Jun 25, 2023Updated 2 years ago
Max-Fu / icrt
View on GitHub
[ICRA 2025] In-Context Imitation Learning via Next-Token Prediction
☆107Mar 17, 2025Updated 11 months ago
FusionBrainLab / LLM-Microscope
View on GitHub
☆71Aug 27, 2024Updated last year
harryjo97 / riemannian-diffusion-mixture-torch
View on GitHub
PyTorch implementation for "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).
☆13Jul 21, 2024Updated last year
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
pranavAL / DART
View on GitHub
Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024
☆11Jun 6, 2024Updated last year
sai-prasanna / dreaming_of_many_worlds
View on GitHub
☆25Sep 23, 2024Updated last year
RajGhugare19 / stitching-is-combinatorial-generalisation
View on GitHub
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆23Apr 19, 2024Updated last year
UT-Austin-RPL / amago
View on GitHub
off-policy RL on long sequences
☆159Feb 17, 2026Updated 2 weeks ago
ahjwang / messenger-emma
View on GitHub
Implements the Messenger environment and EMMA model.
☆25Jun 14, 2023Updated 2 years ago
nmonette / NCC-UED
View on GitHub
Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025
☆17Nov 24, 2025Updated 3 months ago