princeton-nlp/LitSearch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/princeton-nlp/LitSearch)

princeton-nlp / LitSearch

[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search

☆109

Alternatives and similar repositories for LitSearch

Users that are interested in LitSearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

haoxuan-unt2024 / llm4innovation
View on GitHub
☆15Aug 23, 2025Updated 11 months ago
du-nlp-lab / MLR-Copilot
View on GitHub
☆70Mar 30, 2025Updated last year
castorini / dhr
View on GitHub
Dense hybrid representations for text retrieval
☆65Apr 3, 2023Updated 3 years ago
neulab / ragged
View on GitHub
Retrieval Augmented Generation Generalized Evaluation Dataset
☆61Jul 16, 2025Updated last year
zirui-ray-liu / DivAug
View on GitHub
☆13Aug 25, 2021Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
runchu-tian / LongPiBench
View on GitHub
The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"
☆14Dec 16, 2024Updated last year
TREMA-UNH / rubric-grading-workbench
View on GitHub
A Workbench for Autograding Retrieve/Generate Systems
☆15Jun 30, 2025Updated last year
nec-research / fact-linking
View on GitHub
Codebase for "Linking Surface Facts to Large-Scale Knowledge Graphs" (EMNLP 2023)
☆12May 8, 2024Updated 2 years ago
ncodepro / pdfchatbot
View on GitHub
Create a QnA bot on a pdf
☆16May 27, 2023Updated 3 years ago
ulab-uiuc / GraphEval
View on GitHub
[ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You
☆17Mar 18, 2025Updated last year
fresh-stack / freshstack
View on GitHub
This repository helps you evaluate your models on the FreshStack benchmark!
☆34Dec 9, 2025Updated 7 months ago
chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
Daniel-Gong / ChatGPT-WM
View on GitHub
AAAI 2024, "Working Memory Capacity of ChatGPT: An Empirical Study".
☆15Feb 10, 2025Updated last year
thakur-nandan / sprint
View on GitHub
SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.
☆48Jul 25, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AkariAsai / ScholarQABench
View on GitHub
This repository contains ScholarQABench data and evaluation pipeline.
☆158Aug 13, 2025Updated 11 months ago
parameterlab / leaky_thoughts
View on GitHub
Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025
☆17Jan 12, 2026Updated 6 months ago
MiuLab / PairDistill
View on GitHub
Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.
☆22Nov 28, 2024Updated last year
castorini / umbrela
View on GitHub
☆58Apr 18, 2026Updated 3 months ago
liamdugan / summary-qg
View on GitHub
Code for the ACL 2022 Paper "A Feasibility Study of Answer-Agnostic Question Generation for Education"
☆16Jul 5, 2022Updated 4 years ago
moussaKam / FrugalScore
View on GitHub
FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…
☆16Sep 21, 2022Updated 3 years ago
MTSchool / MT-paper-list-of-ACL
View on GitHub
ACL Paper Lists(machine translation)
☆13Mar 23, 2022Updated 4 years ago
McGill-NLP / instruct-qa
View on GitHub
Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"
☆87Aug 12, 2024Updated last year
jiangycTarheel / SQ-Transformer
View on GitHub
☆10Feb 12, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
chentong0 / factoid-wiki
View on GitHub
Dense X Retrieval: What Retrieval Granularity Should We Use?
☆171Jan 8, 2024Updated 2 years ago
zirui-ray-liu / Exact
View on GitHub
☆21Mar 23, 2022Updated 4 years ago
instavm / skill-optimization
View on GitHub
Demonstration of DSPy optimization for Skill.md files
☆15Dec 28, 2025Updated 7 months ago
Archelunch / dspy-toon
View on GitHub
TOON as DSPy adapter
☆26Feb 1, 2026Updated 5 months ago
allenai / asta-paper-finder
View on GitHub
frozen-in-time version of our Paper Finder agent for reproducing evaluation results
☆244Mar 17, 2026Updated 4 months ago
RenzeLou / AAAR-1.0
View on GitHub
The source code for running LLMs on the AAAR-1.0 benchmark.
☆20Apr 5, 2025Updated last year
allenai / scirepeval
View on GitHub
SciRepEval benchmark training and evaluation scripts
☆89May 5, 2026Updated 2 months ago
GuyTevet / diversity-eval
View on GitHub
Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"
☆21Feb 23, 2021Updated 5 years ago
allenai / SciRIFF
View on GitHub
Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.
☆48Mar 17, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
OAfzal / nlp-for-peer-review
View on GitHub
☆55Nov 27, 2024Updated last year
hqsiswiliam / SPT
View on GitHub
Code for Personalized Large Language Models via Selective Prompt Tuning
☆10Jun 26, 2024Updated 2 years ago
xingjian-zhang / massw
View on GitHub
MASSW is a comprehensive text dataset on Multi-Aspect Summarization of Scientific Workflows. MASSW includes more than 152,000 peer-review…
☆22May 16, 2025Updated last year
shulin16 / MMInA
View on GitHub
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
☆54Feb 27, 2025Updated last year
psoulos / role-decomposition
View on GitHub
☆11Feb 11, 2020Updated 6 years ago
EagleW / Scientific-Inspiration-Machines-Optimized-for-Novelty
View on GitHub
Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty
☆95Apr 13, 2024Updated 2 years ago
unicamp-dl / ExaRanker
View on GitHub
☆29Feb 2, 2024Updated 2 years ago