Python package for serving a local search engine. One command to download and serve a datastore---that's it π.
β26Jun 6, 2025Updated 10 months ago
Alternatives and similar repositories for massive-serve
Users that are interested in massive-serve are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".β225Dec 16, 2025Updated 4 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β227Apr 8, 2026Updated last week
- Use the tokenizer in parallel to achieve superior accelerationβ20Mar 21, 2024Updated 2 years ago
- monae: multi-modal single-cell integration and imputationβ13Sep 13, 2024Updated last year
- β14Feb 13, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our focβ¦β32Jun 13, 2024Updated last year
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmarkβ22Aug 22, 2025Updated 7 months ago
- Explainable deep hypergraph learning modeling the peptide secondary structure predictionβ13Feb 27, 2023Updated 3 years ago
- a pytorch version for GREMLIN, used to predict the protein contacts by coevolution method.β17May 30, 2021Updated 4 years ago
- β20Dec 14, 2024Updated last year
- [EMNLP '25] Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"β41Aug 13, 2025Updated 8 months ago
- Berkeley OS Prelim Reading Notesβ15Sep 20, 2023Updated 2 years ago
- The GAN model for designing AMPβ17Aug 19, 2025Updated 8 months ago
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generationβ14Aug 19, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SILO Language Models code repositoryβ83Feb 23, 2024Updated 2 years ago
- Metal binding prediction using coevolutionβ24May 19, 2024Updated last year
- BC-Design: A Biochemistry-Aware Framework for High-Precision Inverse Protein Folding https://www.biorxiv.org/content/10.1101/2024.10.28.6β¦β21Nov 24, 2025Updated 4 months ago
- [NAACL 2025] ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverageβ16Sep 2, 2025Updated 7 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How ModelβTask Alignment Induces Divergent RL Conclusions".β17Feb 9, 2026Updated 2 months ago
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performanceβ17Sep 23, 2022Updated 3 years ago
- Dataset and Evaluation Code for the K-QA Benchmark.β18May 26, 2024Updated last year
- A GPT-powered AI auto scraper for websites. AI Web Scraping made easy.β14Jun 26, 2023Updated 2 years ago
- this is based on the paper Chain-of-Retrieval Augmented Generationβ14Mar 29, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A RAG that can scale π§π»βπ»β11May 28, 2024Updated last year
- TIFMO: Textual Inference Forward-chaining MOduleβ12Apr 25, 2014Updated 11 years ago
- β11Feb 22, 2025Updated last year
- β12Nov 21, 2023Updated 2 years ago
- β12Jul 13, 2023Updated 2 years ago
- CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions (LREC-COLING 2024)β18Oct 9, 2024Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- Generate Software Bill of Materials for R Thingsβ20Feb 9, 2024Updated 2 years ago
- The codebase and some introductions of FineMed.β31Sep 11, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"β32Sep 13, 2024Updated last year
- EdgeSHAPer: Bond-Centric Shapley Value-Based Explanation Method for Graph Neural Networksβ27Mar 30, 2026Updated 3 weeks ago
- coded with and corrected by Google Anti-Gravityβ13Nov 23, 2025Updated 4 months ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchiβ¦β35May 24, 2024Updated last year
- β19Mar 23, 2025Updated last year
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learningβ76May 25, 2025Updated 10 months ago
- [NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.β14Dec 12, 2024Updated last year