π Modular retrievers for zero-shot multilingual IR.
β30Mar 6, 2024Updated 2 years ago
Alternatives and similar repositories for xm-retrievers
Users that are interested in xm-retrievers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CLIR version of ColBERTβ73Jun 23, 2025Updated 9 months ago
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrievalβ36Oct 18, 2024Updated last year
- π Fine-tune OpenAI models for text classification, question answering, and moreβ17May 1, 2023Updated 2 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ22Jun 30, 2025Updated 9 months ago
- β47Mar 27, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answeringβ38May 30, 2023Updated 2 years ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contβ¦β73Dec 30, 2025Updated 3 months ago
- AI model designed to test the effectiveness in handling external ethical attacks.β11Feb 9, 2026Updated last month
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β49Nov 13, 2023Updated 2 years ago
- Code for "RADCoT: Retrieval-Augmented Distillation to Specialization Models for Generating Chain-of-Thoughts in Query Expansion", LREC-COβ¦β11May 25, 2024Updated last year
- SKT A.X LLM K1β29Feb 11, 2026Updated last month
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numbaβ37Oct 16, 2025Updated 5 months ago
- β11Aug 10, 2021Updated 4 years ago
- π§ ResNet: Deep Residual Learning for Image Recognitionβ10Sep 18, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β10Feb 9, 2024Updated 2 years ago
- β15Jun 10, 2024Updated last year
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many daβ¦β15Mar 28, 2025Updated last year
- A multilingual version of MS MARCO passage ranking datasetβ147Oct 19, 2023Updated 2 years ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Mar 20, 2024Updated 2 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systemsβ35Nov 21, 2025Updated 4 months ago
- Semantically Search Emojis From the Command Line!β13Nov 26, 2023Updated 2 years ago
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware iβ¦β29Mar 8, 2026Updated 3 weeks ago
- Cross language information retrieval pipelineβ19Jan 12, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Languagβ¦β23Apr 8, 2024Updated last year
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals β¦β15Jul 19, 2024Updated last year
- Source code for paper Grammatical Error Correction in Low-Resource Scenarios (W-NUT 2019)β13Jun 21, 2022Updated 3 years ago
- π¦ COVID-19 Daily Data from Worldometers with Pythonβ13Feb 28, 2021Updated 5 years ago
- πΈ GlotCC Dataset and Pipline -- NeurIPS 2024β20Apr 6, 2025Updated 11 months ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologiesβ21Oct 24, 2022Updated 3 years ago
- Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"β17Feb 2, 2022Updated 4 years ago
- Prompt Tuning on Graph-augmented Low-resource Text Classification. In TKDE 2024.β15Jan 20, 2025Updated last year
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddingsβ44Mar 6, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β19Aug 9, 2024Updated last year
- The first high-quality, fine-grained error-correction conversation dataset between English second language learner and an educational cβ¦β15Aug 27, 2025Updated 7 months ago
- Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generationβ19Aug 26, 2023Updated 2 years ago
- Official code and dataset repository of KoBBQ (TACL 2024)β19May 13, 2024Updated last year
- Rhythm analysis toolkit in Pythonβ13Sep 29, 2023Updated 2 years ago
- PPAT: Progressive Graph Pairwise Attention Network for Event Causality Identificationβ16Jun 7, 2024Updated last year
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)β17Jul 16, 2024Updated last year