[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search
☆104Dec 2, 2024Updated last year
Alternatives and similar repositories for LitSearch
Users that are interested in LitSearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TOON as DSPy adapter☆25Feb 1, 2026Updated last month
- ☆67Mar 30, 2025Updated last year
- [ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You☆18Mar 18, 2025Updated last year
- ☆20Mar 4, 2025Updated last year
- Dense hybrid representations for text retrieval☆64Apr 3, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Retrieval Augmented Generation Generalized Evaluation Dataset☆61Jul 16, 2025Updated 8 months ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆34Dec 9, 2025Updated 3 months ago
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 2 months ago
- Codebase for "Linking Surface Facts to Large-Scale Knowledge Graphs" (EMNLP 2023)☆13May 8, 2024Updated last year
- A Workbench for Autograding Retrieve/Generate Systems☆15Jun 30, 2025Updated 8 months ago
- ☆55Jan 15, 2026Updated 2 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Create a QnA bot on a pdf☆16May 27, 2023Updated 2 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆87Aug 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- MASSW is a comprehensive text dataset on Multi-Aspect Summarization of Scientific Workflows. MASSW includes more than 152,000 peer-review…☆21May 16, 2025Updated 10 months ago
- This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.☆1,375Aug 13, 2025Updated 7 months ago
- ☆54Updated this week
- AAAI 2024, "Working Memory Capacity of ChatGPT: An Empirical Study".☆15Feb 10, 2025Updated last year
- ☆144Aug 21, 2023Updated 2 years ago
- Claude Code skills for academic papers: deep analysis, comics, summaries | 论文工艺:深度解读、漫画生成、速览总结☆18Jan 29, 2026Updated 2 months ago
- NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM robustness in RAG against first-stage retrieval errors on 18 la…☆27Nov 29, 2024Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆168Jan 8, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Feb 12, 2024Updated 2 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- SciRepEval benchmark training and evaluation scripts☆85Updated this week
- ☆52Nov 27, 2024Updated last year
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…☆24May 2, 2025Updated 10 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated 11 months ago
- FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…☆16Sep 21, 2022Updated 3 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆37Oct 16, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆52Feb 27, 2025Updated last year
- Retrieval-Augmented Generation battle!☆64Mar 22, 2026Updated last week
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆21Feb 23, 2021Updated 5 years ago
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆94Apr 13, 2024Updated last year
- This repository contains ScholarQABench data and evaluation pipeline.☆145Aug 13, 2025Updated 7 months ago
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆104Updated this week
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆47Mar 17, 2025Updated last year