[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search
☆102Dec 2, 2024Updated last year
Alternatives and similar repositories for LitSearch
Users that are interested in LitSearch are comparing it to the libraries listed below
Sorting:
- ☆67Mar 30, 2025Updated 11 months ago
- TOON as DSPy adapter☆25Feb 1, 2026Updated last month
- Dense hybrid representations for text retrieval☆64Apr 3, 2023Updated 2 years ago
- ☆10Feb 12, 2024Updated 2 years ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Feb 27, 2025Updated last year
- Codebase for "Linking Surface Facts to Large-Scale Knowledge Graphs" (EMNLP 2023)☆13May 8, 2024Updated last year
- ☆14Aug 25, 2021Updated 4 years ago
- ☆55Jan 15, 2026Updated last month
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆168Jan 8, 2024Updated 2 years ago
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Nov 4, 2025Updated 4 months ago
- AAAI 2024, "Working Memory Capacity of ChatGPT: An Empirical Study".☆15Feb 10, 2025Updated last year
- ExpressJS server for the GitWit React IDE.☆16May 28, 2024Updated last year
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Mar 9, 2025Updated last year
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 3 years ago
- A Workbench for Autograding Retrieve/Generate Systems☆15Jun 30, 2025Updated 8 months ago
- An R Package to process unstructured data with IBM Watson Developer Cloud Services☆13May 10, 2017Updated 8 years ago
- Recursive Visual Programming (ECCV 2024)☆18Nov 20, 2024Updated last year
- Summarize the top 30 most popular arXiv papers on Reddit, Hacker News and Hugging Face in the last 30 days. And post them to Slack, Twitt…☆24Jul 5, 2025Updated 8 months ago
- ☆65Aug 14, 2024Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆112Sep 28, 2024Updated last year
- This repository helps you evaluate your models on the FreshStack benchmark!☆33Dec 9, 2025Updated 3 months ago
- ☆17Oct 22, 2024Updated last year
- Local LLM enabled Human terminal interaction made easy.☆18Dec 31, 2024Updated last year
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…☆16Sep 21, 2022Updated 3 years ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Apr 9, 2025Updated 11 months ago
- ☆19Mar 16, 2025Updated 11 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated 11 months ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆17May 19, 2025Updated 9 months ago
- ☆140Aug 21, 2023Updated 2 years ago
- SciRepEval benchmark training and evaluation scripts☆81Mar 3, 2026Updated last week
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆141Apr 22, 2025Updated 10 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Jan 11, 2024Updated 2 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆47Mar 17, 2025Updated 11 months ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago
- ☆21Jul 11, 2025Updated 7 months ago