Model implementation for the contextual embeddings project
β47Jun 2, 2025Updated 10 months ago
Alternatives and similar repositories for contextual-embeddings
Users that are interested in contextual-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A RAG that can scale π§π»βπ»β11May 28, 2024Updated last year
- PreRanker: reranking tools before tool-useβ21Apr 9, 2025Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ23Jun 30, 2025Updated 10 months ago
- Use contrastive learning to train a large language model (LLM) as a retrieverβ12Jul 19, 2024Updated last year
- β16Nov 11, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.β47Jul 25, 2023Updated 2 years ago
- π Fine-tune OpenAI models for text classification, question answering, and moreβ17May 1, 2023Updated 3 years ago
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrievalβ17Feb 13, 2026Updated 2 months ago
- A massively multilingual modern encoder language modelβ140Jan 20, 2026Updated 3 months ago
- β62Jul 21, 2024Updated last year
- β57Jul 10, 2025Updated 9 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.β51Dec 7, 2024Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β227Apr 8, 2026Updated 3 weeks ago
- Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"β13Jul 23, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmarkβ22Aug 22, 2025Updated 8 months ago
- Drops literature reviews in public placesβ13Mar 31, 2023Updated 3 years ago
- Code for "Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification", arXiv 2024β14Jun 24, 2024Updated last year
- Python package for Model Metric Uncertainty estimationβ16Sep 5, 2024Updated last year
- β39Jan 19, 2026Updated 3 months ago
- β15Jun 19, 2025Updated 10 months ago
- β11Feb 9, 2024Updated 2 years ago
- [TMLR 2025 & ICLR 2025 DeLTa] Official Implementation of Design Editing for Offline Model-based Optimization 𧬠π€β10Apr 17, 2025Updated last year
- Code for the MTEB leaderboardβ30Feb 4, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- gRPC server for hnswlibβ16Mar 6, 2023Updated 3 years ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddingsβ45Mar 6, 2024Updated 2 years ago
- German dataset for DPR model trainingβ19Jul 21, 2024Updated last year
- π€ Trade any tensors over the networkβ31Sep 27, 2023Updated 2 years ago
- β17Jan 5, 2023Updated 3 years ago
- My NER Experiments with ModernBERT and Ettinβ27Jul 17, 2025Updated 9 months ago
- Scaling Laws for Mixture of Experts Modelsβ15Feb 25, 2025Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"β43Mar 31, 2025Updated last year
- β15Dec 15, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β160Jul 14, 2025Updated 9 months ago
- MIRAGE is a light benchmark to evaluate RAG performance.β37May 18, 2025Updated 11 months ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrievalβ198Sep 13, 2025Updated 7 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Sep 19, 2025Updated 7 months ago
- code for training & evaluating Contextual Document Embedding modelsβ203May 14, 2025Updated 11 months ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Mar 20, 2024Updated 2 years ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoreticaβ¦β16Sep 4, 2025Updated 7 months ago