Model implementation for the contextual embeddings project
β47Jun 2, 2025Updated last year
Alternatives and similar repositories for contextual-embeddings
Users that are interested in contextual-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A RAG that can scale π§π»βπ»β11May 28, 2024Updated 2 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ24Jun 30, 2025Updated last year
- Use contrastive learning to train a large language model (LLM) as a retrieverβ12Jul 19, 2024Updated last year
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddingsβ13May 22, 2025Updated last year
- β17Nov 11, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.β48Jul 25, 2023Updated 2 years ago
- π Fine-tune OpenAI models for text classification, question answering, and moreβ17May 1, 2023Updated 3 years ago
- This repository helps you evaluate your models on the FreshStack benchmark!β34Dec 9, 2025Updated 6 months ago
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrievalβ18Feb 13, 2026Updated 4 months ago
- A massively multilingual modern encoder language modelβ143Jan 20, 2026Updated 5 months ago
- β63Jul 21, 2024Updated last year
- β57Jul 10, 2025Updated 11 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β230Jun 20, 2026Updated last week
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"β22Mar 31, 2025Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"β13Jul 23, 2023Updated 2 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Rankingβ25Apr 4, 2025Updated last year
- Python library to use Pleias-RAG modelsβ72Jun 20, 2026Updated last week
- β10Oct 2, 2024Updated last year
- β44Jan 19, 2026Updated 5 months ago
- ADAG: Transluce's MLP neuron-level circuit tracing libraryβ29Apr 10, 2026Updated 2 months ago
- β15Jun 19, 2025Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numbaβ38Oct 16, 2025Updated 8 months ago
- β11Feb 9, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the MTEB leaderboardβ31Feb 4, 2025Updated last year
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddingsβ45Mar 6, 2024Updated 2 years ago
- β17Jan 5, 2023Updated 3 years ago
- My NER Experiments with ModernBERT and Ettinβ29Jul 17, 2025Updated 11 months ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"β44Mar 31, 2025Updated last year
- Scaling Laws for Mixture of Experts Modelsβ15Feb 25, 2025Updated last year
- β15Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β161Jul 14, 2025Updated 11 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Sep 19, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answeringβ38May 30, 2023Updated 3 years ago
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.β28Jun 23, 2026Updated last week
- code for training & evaluating Contextual Document Embedding modelsβ206May 14, 2025Updated last year
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Mar 20, 2024Updated 2 years ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoreticaβ¦β16Sep 4, 2025Updated 9 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systemsβ36Nov 21, 2025Updated 7 months ago
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware iβ¦β29Mar 8, 2026Updated 3 months ago