jduquevan/advantage-alignment

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jduquevan/advantage-alignment)

jduquevan / advantage-alignment

Advantage Alignment Algorithms (ICLR 2025 oral)

☆20

Alternatives and similar repositories for advantage-alignment

Users that are interested in advantage-alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cooperativex / SocialJax
View on GitHub
SocialJax: sequential social dilemma environments
☆90Jul 5, 2026Updated 3 weeks ago
pierrelux / rlbook
View on GitHub
A graduate-level introduction to reinforcement learning as a framework for modeling, optimization, and control, connecting dynamic models…
☆18Dec 9, 2025Updated 7 months ago
011235813 / lio
View on GitHub
Learning to Incentivize Other Learning Agents
☆36Jun 13, 2022Updated 4 years ago
minyoungpark1 / swin_transformer_v2_jax
View on GitHub
This project compares the performance of Swin-Transformer v2 implemented in JAX and PyTorch.
☆12Jun 8, 2022Updated 4 years ago
luchris429 / model-free-opponent-shaping
View on GitHub
Code for Model-Free Opponent Shaping (ICML 2022)
☆24Nov 18, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lfochamon / csl
View on GitHub
csl: PyTorch-based Constrained Learning
☆11Jun 1, 2022Updated 4 years ago
EvanZhuang / wavspa
View on GitHub
WavSpA: Wavelet Space Attention for Enhancing Transformer's Long Sequence Learning
☆13Feb 24, 2024Updated 2 years ago
xuesongwang / Neural-Process-Family
View on GitHub
☆11May 26, 2023Updated 3 years ago
ninell-oldenburg / social-contracts
View on GitHub
☆13Mar 12, 2024Updated 2 years ago
r-three / smear
View on GitHub
☆30Sep 28, 2023Updated 2 years ago
itstyren / InteractionMARL-Coop
View on GitHub
Code for "Enhancing Cooperation through Selective Interaction and Long-term Experiences in Multi-Agent Reinforcement Learning", IJCAI24.
☆14Feb 9, 2025Updated last year
DHDev0 / Muzero-unplugged
View on GitHub
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆36Jun 25, 2025Updated last year
ha0ransun / Path-Auxiliary-Sampler
View on GitHub
☆10Feb 22, 2023Updated 3 years ago
URRealHero / JudgeAnything
View on GitHub
☆17Jun 1, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
scxue / SA-Solver
View on GitHub
Official code for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models (NeurIPS 2023)
☆14Mar 4, 2024Updated 2 years ago
dilinwang820 / adaptive-f-divergence
View on GitHub
A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"
☆20Jan 11, 2019Updated 7 years ago
Bam4d / conditional-action-trees
View on GitHub
Example Code for the Conditional Action Trees Paper
☆12May 24, 2021Updated 5 years ago
SJTU-DENG-Lab / Orthogonal-Neural-operator
View on GitHub
Code for orthogonal neural operator
☆17Oct 15, 2023Updated 2 years ago
uoe-agents / derl
View on GitHub
The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)
☆26Feb 3, 2022Updated 4 years ago
KaiXIIM / dipllm
View on GitHub
This is the official implementation of the paper "DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy".
☆25Dec 19, 2025Updated 7 months ago
okyksl / flow-lp
View on GitHub
Code for "Semantic Perturbations with Normalizing Flows for Improved Generalization"
☆11Jul 13, 2021Updated 5 years ago
opooladz / Preconditioned-Stochastic-Gradient-Descent
View on GitHub
A repo based on XiLin Li's PSGD repo that extends some of the experiments.
☆14Oct 7, 2024Updated last year
fidel-schaposnik / muzero
View on GitHub
Tensorflow implementation of MuZero algorithm
☆11Aug 23, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Alexander-Nasuta / graph-jsp-env
View on GitHub
A Gymnasium Environment for the Job Shop Problem Using the Disjunctive Graph Approach.
☆29May 4, 2026Updated 2 months ago
slachapelle / disentanglement_via_mechanism_sparsity
View on GitHub
☆19Jan 12, 2024Updated 2 years ago
lzy12301 / PalSB
View on GitHub
Source code for paper "Physics-aligned field reconstruction with diffusionn bridge"
☆15Feb 12, 2025Updated last year
alvarobartt / safejax
View on GitHub
Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`
☆47May 31, 2024Updated 2 years ago
OpenEarthLab / FNP
View on GitHub
[NeurIPS 2024] FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation
☆14Mar 4, 2025Updated last year
Xihaier / HiNOTE
View on GitHub
[ICML 2024] Official implementation for the paper "Hierarchical Neural Operator Transformer with Learnable Frequency-aware Loss Prior for…
☆16Nov 8, 2024Updated last year
openreview / openreview-matcher
View on GitHub
☆23Apr 1, 2026Updated 3 months ago
lingxiaoli94 / CWB
View on GitHub
Source code for "Continuous Regularized Wasserstein Barycenters" [NeurIPS 2020].
☆16Nov 4, 2020Updated 5 years ago
seunghyukoh / ReVISE
View on GitHub
☆15Aug 11, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
seharanul17 / synthetic-tabular-LLM
View on GitHub
☆16Dec 3, 2024Updated last year
LJC-FVNR / In-context-Time-Series-Predictor
View on GitHub
Implementation of the paper "In-context Time Series Predictor" (ICLR 2025)
☆16Feb 11, 2025Updated last year
probabilisticai / tropai-2024
View on GitHub
Materials of the Tropical Probabilistic AI School 2024.
☆23Feb 1, 2024Updated 2 years ago
zhuohaoyu / ORPS
View on GitHub
☆15Jul 15, 2025Updated last year
google / omnimatte-sp
View on GitHub
☆11Mar 27, 2026Updated 3 months ago
J-zin / energy-discrepancy
View on GitHub
NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models
☆18Oct 22, 2024Updated last year
ozyyshr / RAST
View on GitHub
Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)
☆22Oct 16, 2025Updated 9 months ago