mlbio-epfl/LaMer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mlbio-epfl/LaMer)

mlbio-epfl / LaMer

[ICLR 2026] Meta-RL Induces Exploration in Language Agents

☆45

Alternatives and similar repositories for LaMer

Users that are interested in LaMer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eth-medical-ai-lab / smmile
View on GitHub
[NeurIPS Datasets & Benchmarks 2025] SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning
☆15Dec 2, 2025Updated 7 months ago
thu-nics / MARSHAL
View on GitHub
[ICLR'26] MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs
☆54Apr 17, 2026Updated 3 months ago
ec2604 / ContraBAR
View on GitHub
☆13May 21, 2023Updated 3 years ago
ml-jku / plstm_experiments
View on GitHub
☆16Oct 21, 2025Updated 9 months ago
hanningzhang / ER-PRM
View on GitHub
☆20Dec 14, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
usail-hkust / Agent-Omit
View on GitHub
Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Reinforcement Learning
☆32May 11, 2026Updated 2 months ago
2187Nick / ADAS
View on GitHub
Automated Design of Agentic Systems
☆10Sep 7, 2024Updated last year
zelaix / VS-Bench
View on GitHub
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments
☆25Sep 30, 2025Updated 9 months ago
sitaocheng / DERL
View on GitHub
The code repo for the paper "Differentiable Evolutionary Reinforcement Learning"
☆18Jan 6, 2026Updated 6 months ago
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
youngsoul0731 / FLORA-Bench
View on GitHub
[Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performances
☆20Jan 15, 2026Updated 6 months ago
syr-cn / ReMemR1
View on GitHub
Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents
☆43Apr 13, 2026Updated 3 months ago
langfengQ / verl-agent
View on GitHub
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆2,158Jun 9, 2026Updated last month
EleutherAI / deep-ignorance
View on GitHub
☆20Jan 7, 2026Updated 6 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
NagisaZj / MetaCURE-Public
View on GitHub
☆15Apr 5, 2023Updated 3 years ago
Stanford-ILIAD / Diverse-Conventions
View on GitHub
Exploring techniques to generate diverse conventions in multi-agent settings
☆16Nov 14, 2023Updated 2 years ago
daochenzha / autosmote
View on GitHub
[CIKM 2022] Towards Automated Over-Sampling for Imbalanced Classification
☆10Mar 20, 2023Updated 3 years ago
MIRALab-USTC / RL-SPF
View on GitHub
a representation learning method that predicts the Fourier transform of state sequences to improve sample efficiency of RL algorithms.
☆20Oct 26, 2023Updated 2 years ago
UT-Austin-RPL / amago
View on GitHub
off-policy RL on long sequences
☆169May 29, 2026Updated 2 months ago
nabenabe0928 / meta-learn-tpe
View on GitHub
[IJCAI'23] Speeding Up Multi-Objective Hyperparameter Optimization by Task Similarity-Based Meta-Learning for the Tree-Structured Parzen …
☆10Apr 24, 2026Updated 3 months ago
alchemistyzz / PeRL
View on GitHub
[NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"
☆30Mar 30, 2026Updated 3 months ago
microsoft / SuperRL
View on GitHub
☆15Sep 8, 2025Updated 10 months ago
zhangxy-2019 / RetroAgent
View on GitHub
RETROAGENT: From Solving to Evolving via Retrospective Dual Intrinsic Feedback
☆26Mar 30, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
microsoft / webgym
View on GitHub
This project includes code for using the AsyncWebRL and WebGym frameworks to train web agent models.
☆46Jun 9, 2026Updated last month
bolt-research / popgym-arcade
View on GitHub
Atari-style POMDPs
☆34Updated this week
mctaylorpants / builtonrails.com
View on GitHub
Showcasing the power of Ruby on Rails.
☆12Jun 7, 2020Updated 6 years ago
WillDreamer / T2PO
View on GitHub
【ICML2026 Spotlight】 T2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning
☆51May 27, 2026Updated 2 months ago
w-yibo / R1-Compress
View on GitHub
[NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
☆17Jan 24, 2026Updated 6 months ago
zzwjames / DPGBA
View on GitHub
An official implementation of "Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective" (KDD 2024)
☆12Sep 16, 2024Updated last year
sjtu-mvasl-robotics / AnyBipe
View on GitHub
☆20Sep 20, 2024Updated last year
jjiantong / FastBO
View on GitHub
[CVPR 2024] Efficient Hyperparameter Optimization with Adaptive Fidelity Identification
☆12Jul 12, 2024Updated 2 years ago
automl / MODNAS
View on GitHub
Official Repo for "Multi-objective Differentiable Neural Architecture Search"
☆13Jul 12, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jinpeng0528 / SEFE
View on GitHub
☆13May 6, 2025Updated last year
rujiewu / Bongard-OpenWorld
View on GitHub
This is the official code implementation of Bongard-OpenWorld (ICLR 2024).
☆14Jan 6, 2025Updated last year
Ruiyang-061X / VL-Uncertainty
View on GitHub
🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".
☆56Mar 18, 2025Updated last year
fmi-basel / latent-predictive-learning
View on GitHub
Code to accompany our paper "The combination of Hebbian and predictive plasticity learns invariant object representations in deep sensory…
☆32Jan 14, 2025Updated last year
DDVD233 / QoQ_Med
View on GitHub
☆52Jul 31, 2025Updated 11 months ago
machinelearningnuremberg / DyHPO
View on GitHub
[NeurIPS 2022] Supervising the Multi-Fidelity Race of Hyperparameter Configurations
☆14Apr 25, 2023Updated 3 years ago
DeepSoftwareAnalytics / Telly
View on GitHub
Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond
☆23Apr 9, 2023Updated 3 years ago