tmlr-group/AR-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tmlr-group/AR-Bench)

tmlr-group / AR-Bench

[ICML 2025] "From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?"

☆47

Alternatives and similar repositories for AR-Bench

Users that are interested in AR-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Aboriginer / EOE
View on GitHub
[ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"
☆15Feb 15, 2025Updated last year
tmlr-group / ECON
View on GitHub
[ICML 2025] "From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium"
☆39Nov 23, 2025Updated 7 months ago
tmlr-group / NoisyRationales
View on GitHub
[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"
☆40Jul 18, 2025Updated last year
tmlr-group / landscape-of-thoughts
View on GitHub
[ICLR 2026] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"
☆61May 21, 2026Updated 2 months ago
tmlr-group / CoPA
View on GitHub
[NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"
☆11Nov 15, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
tmlr-group / EOE
View on GitHub
[ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"
☆13Feb 15, 2025Updated last year
trieu / Knowledge-Graph-Embedding
View on GitHub
Some papers on Knowledge Graph Embedding(KGE)
☆15Aug 16, 2022Updated 3 years ago
Aboriginer / ZS-NTTA
View on GitHub
[ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"
☆16Feb 22, 2025Updated last year
tmlr-group / G-effect
View on GitHub
[ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"
☆16Feb 27, 2025Updated last year
tmlr-group / RGIB
View on GitHub
[NeurIPS 2023] "Combating Bilateral Edge Noise for Robust Link Prediction"
☆11Nov 3, 2023Updated 2 years ago
tmlr-group / Co-rewarding
View on GitHub
[ICLR 2026] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"
☆58Feb 4, 2026Updated 5 months ago
AndrewZhou924 / RGIB
View on GitHub
[NeurIPS 2023] Combating Bilateral Edge Noise for Robust Link Prediction
☆13Nov 3, 2023Updated 2 years ago
junchaoIU / DetectRL
View on GitHub
[NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
☆16Nov 19, 2024Updated last year
warriors-30 / SFAT-paddle
View on GitHub
☆24Jun 28, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
NLP2CT / NLPCC-2025-Task1
View on GitHub
NLPCC-2025 Shared-Task 1: LLM-Generated Text Detection
☆18Apr 6, 2026Updated 3 months ago
NLP2CT / RepreGuard
View on GitHub
[TACL 2025] RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
☆16Jan 27, 2026Updated 5 months ago
tmlr-group / ZS-NTTA
View on GitHub
[ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"
☆13Feb 22, 2025Updated last year
KID-22 / LLM-IR-Bias-Fairness-Survey
View on GitHub
This is the repo for the survey of Bias and Fairness in IR with LLMs.
☆63Sep 4, 2025Updated 10 months ago
uw-nsl / safechain
View on GitHub
[ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
☆30Apr 2, 2025Updated last year
resistzzz / Co-rewarding
View on GitHub
[ICLR2026] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"
☆30Feb 4, 2026Updated 5 months ago
Dongping-Chen / MixSet
View on GitHub
(NAACL 2024) Official code repository for Mixset.
☆27Dec 4, 2024Updated last year
yzhao062 / auditable
View on GitHub
Audit any agent decision across its past, present, and future, on one typed graph.
☆16Updated this week
ZFancy / awesome-activation-engineering
View on GitHub
A curated list of resources for activation engineering
☆140Oct 2, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zjukg / CCKS2024_CGQA
View on GitHub
☆11May 17, 2024Updated 2 years ago
DaDaMrX / AutoTOD
View on GitHub
Official repository of the ACL 2024 paper "Rethinking Task-Oriented Dialogue Systems: From Complex Modularity to Zero-Shot Autonomous Age…
☆20May 28, 2024Updated 2 years ago
XMUDeepLIT / SSR
View on GitHub
Code for "Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal" (ACL 2024)
☆17Oct 21, 2024Updated last year
dyne-submission / dynamics-aware-embeddings
View on GitHub
☆16Sep 25, 2019Updated 6 years ago
fjc2005 / DetectAnyLLM
View on GitHub
[ACMMM 2025] Official Code of DetectAnyLLM: Towards Generalizable and Robust Detection of Machine-Generated Text Across Domains and Model…
☆26Sep 23, 2025Updated 9 months ago
yuandong-tian / understanding
View on GitHub
Understanding deep networks and large models.
☆30Jan 23, 2026Updated 5 months ago
ArikReuter / ICL_for_Full_Bayesian_Inference
View on GitHub
This repository contains the code for the paper "Can Transformers Learn Full Bayesian Inference In Context?"
☆16Apr 6, 2025Updated last year
tmlr-group / DeepInception
View on GitHub
[arXiv:2311.03191] "DeepInception: Hypnotize Large Language Model to Be Jailbreaker"
☆176Feb 20, 2024Updated 2 years ago
NickyFot / ACMMM22_LearningLabelRelationships
View on GitHub
☆11Jun 20, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
robjsliwa / llama-agent
View on GitHub
Fun project to run your own LLM chat bot using llama.cpp
☆11Jun 9, 2023Updated 3 years ago
XueJiang16 / NegLabel
View on GitHub
[ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"
☆30Oct 23, 2024Updated last year
Egg-Hu / LoRA-Recycle
View on GitHub
[CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
☆14Jun 20, 2025Updated last year
ZIB-IOL / SMS
View on GitHub
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 9 months ago
THU-BPM / RAPL
View on GitHub
Code and data for EMNLP 2023 paper "RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction"
☆18Mar 6, 2024Updated 2 years ago
KiddoZhu / auto-paper-list
View on GitHub
A Python tool for automatic constructing and downloading paper list
☆16Oct 16, 2019Updated 6 years ago
SELGroup / SeSE
View on GitHub
This repository contains the code implementation for the paper "SeSE: Black-Box Uncertainty Quantification for Large Language Models Base…
☆18Jul 6, 2026Updated 2 weeks ago