RulinShao / massive-serveLinks

Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.

☆23

Alternatives and similar repositories for massive-serve

Users that are interested in massive-serve are comparing it to the libraries listed below

Sorting:

orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆48Updated last year
swj0419 / in-context-pretraining
☆54Updated last year
yuzhaouoe / pretraining-data-packing
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆22Updated last year
CodeCreator / WebOrganizer
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
☆67Updated 5 months ago
TIGER-AI-Lab / MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆148Updated last year
allenai / bff
☆39Updated last year
nkandpa2 / long_tail_knowledge
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆78Updated 2 years ago
ielab / PromptReps
Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval
☆51Updated 4 months ago
neulab / data-agora
[ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆39Updated 10 months ago
xlang-ai / BRIGHT
[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
☆168Updated last month
joeljang / ELM
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Updated 2 years ago
McGill-NLP / retriever-lm-reasoning
Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…
☆28Updated last year
GAIR-NLP / benbench
Benchmarking Benchmark Leakage in Large Language Models
☆55Updated last year
kaistAI / factual-knowledge-acquisition
☆23Updated 2 months ago
jzbjyb / ReAtt
Retrieval as Attention
☆82Updated 2 years ago
abhika-m / FAVA
☆74Updated last year
shizhediao / R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…
☆122Updated last year
thu-coai / PICL
Code for ACL2023 paper: Pre-Training to Learn in Context
☆107Updated last year
HazyResearch / skill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆47Updated last year
wzhouad / context-faithful-llm
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆41Updated 2 years ago
kaistAI / Janus
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆51Updated 2 months ago
kaistAI / InstructIR
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Updated last year
eladsegal / strategyqa
The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".
☆81Updated 2 years ago
kernelmachine / silo-lm
SILO Language Models code repository
☆83Updated last year
awslabs / rag-qa-arena
☆48Updated last year
sail-sg / sailcraft
🚢 Data Toolkit for Sailor Language Models
☆94Updated 8 months ago
OSU-NLP-Group / In-Context-Reranking
[ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"
☆36Updated 6 months ago
texttron / BrowseComp-Plus
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent
☆101Updated this week
princeton-nlp / LLMBar
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
☆132Updated last year
EleutherAI / pile_dedupe
Pile Deduplication Code
☆19Updated 2 years ago