OSU-NLP-Group / In-Context-RerankingLinks

[ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"

☆36

Alternatives and similar repositories for In-Context-Reranking

Users that are interested in In-Context-Reranking are comparing it to the libraries listed below

Sorting:

orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆48Updated last year
abhika-m / FAVA
☆74Updated last year
xlang-ai / BRIGHT
[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
☆169Updated last month
oriyor / reasoning-on-cots
Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"
☆96Updated last year
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
TIGER-AI-Lab / MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆148Updated last year
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆51Updated 4 months ago
princeton-nlp / LitSearch
[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search
☆98Updated 10 months ago
GAIR-NLP / benbench
Benchmarking Benchmark Leakage in Large Language Models
☆55Updated last year
ytyz1307zzh / RefAug
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
☆55Updated last year
awslabs / rag-qa-arena
☆48Updated last year
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆82Updated last year
neulab / data-agora
[ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆39Updated 10 months ago
LuLuLuyi / LongHeads
[EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆31Updated last year
ielab / PromptReps
Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval
☆51Updated 4 months ago
RulinShao / retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆216Updated this week
qhjqhj00 / WebBrain
☆68Updated 2 years ago
orionw / rank1
Test-time compute in information retrieval
☆45Updated 3 months ago
Zayne-sprague / MuSR
☆53Updated last year
allenai / super-benchmark
☆48Updated 6 months ago
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
DAMO-NLP-SG / CaRing
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs
☆40Updated last year
HKUNLP / STRING
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
☆78Updated 11 months ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆90Updated last year
facebookresearch / ReasonIR
Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".
☆205Updated 4 months ago
texttron / BrowseComp-Plus
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent
☆101Updated last week
SALT-NLP / demonstrated-feedback
☆128Updated last year
wangcunxiang / Graph-aS-Tokens
☆10Updated 10 months ago
belindal / ERASE
Code and Data for "Language Modeling with Editable External Knowledge"
☆36Updated last year
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆59Updated last year