π Modular retrievers for zero-shot multilingual IR.
β30Mar 6, 2024Updated 2 years ago
Alternatives and similar repositories for xm-retrievers
Users that are interested in xm-retrievers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CLIR version of ColBERTβ73Jun 23, 2025Updated 10 months ago
- π Fine-tune OpenAI models for text classification, question answering, and moreβ17May 1, 2023Updated 3 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ23Jun 30, 2025Updated 10 months ago
- β47Mar 27, 2022Updated 4 years ago
- Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answeringβ38May 30, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contβ¦β74Dec 30, 2025Updated 4 months ago
- ACL 2023 Dual-Alignment Pre-training for Cross-lingual Sentence Embeddingβ24Aug 21, 2024Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β49Nov 13, 2023Updated 2 years ago
- β12Nov 22, 2024Updated last year
- Code for "RADCoT: Retrieval-Augmented Distillation to Specialization Models for Generating Chain-of-Thoughts in Query Expansion", LREC-COβ¦β11May 25, 2024Updated last year
- β10Oct 2, 2024Updated last year
- SKT A.X LLM K1β29Feb 11, 2026Updated 2 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numbaβ38Oct 16, 2025Updated 6 months ago
- β11Aug 10, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- π§ ResNet: Deep Residual Learning for Image Recognitionβ10Sep 18, 2021Updated 4 years ago
- β11Feb 9, 2024Updated 2 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddingsβ13May 22, 2025Updated 11 months ago
- A small MNIST-like The Simpsons character database to at least have some fun while training neural networks.β12May 12, 2021Updated 4 years ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many daβ¦β15Mar 28, 2025Updated last year
- A multilingual version of MS MARCO passage ranking datasetβ147Oct 19, 2023Updated 2 years ago
- ιη¨η₯θ―εΎθ°±εδΈδΈζζ£η΄’ζΎθζι«δΏ‘ζ―ζ£η΄’ηη²ΎεΊ¦β10Oct 30, 2024Updated last year
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systemsβ35Nov 21, 2025Updated 5 months ago
- Semantically Search Emojis From the Command Line!β13Nov 26, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware iβ¦β29Mar 8, 2026Updated 2 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β82Mar 18, 2024Updated 2 years ago
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024β69Oct 21, 2025Updated 6 months ago
- Cross language information retrieval pipelineβ19Jan 12, 2026Updated 3 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals β¦β15Jul 19, 2024Updated last year
- Source code for paper Grammatical Error Correction in Low-Resource Scenarios (W-NUT 2019)β13Jun 21, 2022Updated 3 years ago
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"β14Aug 13, 2025Updated 8 months ago
- [NeurIPS 2024] πΈ GlotCC Dataset and Piplineβ20Apr 6, 2025Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologiesβ21Apr 27, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"β17Feb 2, 2022Updated 4 years ago
- Codebase of ACL2024 paper "Spiral of Silence: How is Large Language Model Killing Information Retrieval?βA Case Study on Open Domain Quesβ¦β16Jun 4, 2024Updated last year
- Electronic Arts (EA) NLP Assignment for: Associate Data Scientistβ13Aug 20, 2024Updated last year
- Prompt Tuning on Graph-augmented Low-resource Text Classification. In TKDE 2024.β15Jan 20, 2025Updated last year
- β19Aug 9, 2024Updated last year
- β17Nov 14, 2022Updated 3 years ago
- The first high-quality, fine-grained error-correction conversation dataset between English second language learner and an educational cβ¦β15Aug 27, 2025Updated 8 months ago