illuin-tech/contextual-embeddings

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/illuin-tech/contextual-embeddings)

illuin-tech / contextual-embeddings

Model implementation for the contextual embeddings project

☆47

Alternatives and similar repositories for contextual-embeddings

Users that are interested in contextual-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rasyosef / splade-index
View on GitHub
Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba
☆38Oct 16, 2025Updated 9 months ago
jataware / XRR2
View on GitHub
Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark
☆22Aug 22, 2025Updated 10 months ago
pau-mensa / xtr-warp-rs
View on GitHub
High performance implementation of the WARP (SIGIR'25) retrieval engine.
☆35May 21, 2026Updated last month
hltcoe / rank-k
View on GitHub
Repository for the listwise reranker Rank-K
☆16May 23, 2025Updated last year
oceanumeric / EnteRAG
View on GitHub
A RAG that can scale 🧑🏻‍💻
☆11May 28, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hseb-benchmark / hseb
View on GitHub
HSEB: Hybrid Search Engine Benchmark
☆21Oct 5, 2025Updated 9 months ago
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
webis-de / rank-distillm
View on GitHub
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking
☆25Apr 4, 2025Updated last year
orionw / rank1
View on GitHub
Test-time compute in information retrieval
☆59Jul 8, 2025Updated last year
ielab / Starbucks
View on GitHub
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆25Jun 30, 2025Updated last year
texttron / AgentIR
View on GitHub
AgentIR is a retriever specialized for Deep Research agents.
☆62Apr 16, 2026Updated 3 months ago
recombee / CompresSAE
View on GitHub
Sparse Embedding Compression for Scalable Retrieval in Recommender Systems
☆39Nov 21, 2025Updated 7 months ago
embeddings-benchmark / results
View on GitHub
Data for the MTEB leaderboard
☆58Updated this week
fresh-stack / freshstack
View on GitHub
This repository helps you evaluate your models on the FreshStack benchmark!
☆34Dec 9, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
liuqi6777 / pe_rank
View on GitHub
Leveraging passage embeddings for efficient listwise reranking with large language models.
☆51Dec 7, 2024Updated last year
voidism / EAR
View on GitHub
Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering
☆38May 30, 2023Updated 3 years ago
urchade / EnriCo
View on GitHub
EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction
☆26May 22, 2024Updated 2 years ago
LeeSureman / E5-Retrieval-Reproduction
View on GitHub
Use contrastive learning to train a large language model (LLM) as a retriever
☆12Jul 19, 2024Updated 2 years ago
knowledgeable-embedding / knowledgeable-embedding
View on GitHub
Knowledgeable Embedding: Injecting dynamically updatable entity knowledge into embeddings to enhance RAG
☆15Aug 31, 2025Updated 10 months ago
mixedbread-ai / maxsim-cpu
View on GitHub
☆57Jul 10, 2025Updated last year
mixedbread-ai / batched
View on GitHub
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆161Jul 14, 2025Updated last year
lightonai / fast-plaid
View on GitHub
High-Performance Engine for Multi-Vector Search
☆268May 28, 2026Updated last month
lightonai / pylate-rs
View on GitHub
PyLate efficient inference engine
☆87Jan 7, 2026Updated 6 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
omarkamali / borgllm
View on GitHub
A zero-config OpenAI client with support for 20+ providers, API key rotation, rate limits, optional LangChain integration and more.
☆19Dec 11, 2025Updated 7 months ago
mjeensung / xtr-pytorch
View on GitHub
☆19May 16, 2024Updated 2 years ago
vectara / FaithJudge
View on GitHub
☆18Nov 11, 2025Updated 8 months ago
catid / lllm
View on GitHub
Latent Large Language Models
☆19Aug 24, 2024Updated last year
ant-louis / xm-retrievers
View on GitHub
🌏 Modular retrievers for zero-shot multilingual IR.
☆30Mar 6, 2024Updated 2 years ago
JHU-CLSP / ettin-encoder-vs-decoder
View on GitHub
State-of-the-art paired encoder and decoder models (17M-1B params)
☆74Aug 6, 2025Updated 11 months ago
feyninc / tokie
View on GitHub
🍡 30x faster tokenization for every HuggingFace model
☆47May 28, 2026Updated last month
AIR-Bench / AIR-Bench
View on GitHub
[ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
☆167Mar 29, 2026Updated 3 months ago
Pleias / Pleias-RAG-Library
View on GitHub
Python library to use Pleias-RAG models
☆72Jul 1, 2026Updated 2 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
DunZhang / Jasper-Token-Compression-Training
View on GitHub
The training codes of Jasper-Token-Compression-600M
☆20Nov 19, 2025Updated 8 months ago
alvarobartt / opentrain
View on GitHub
🚂 Fine-tune OpenAI models for text classification, question answering, and more
☆17May 1, 2023Updated 3 years ago
illuin-tech / modernvbert
View on GitHub
ModernVBERT is a 250M-parameter vision–language encoder that aligns a text-encoder (Ettin-150M) with a vision-encoder (SigLIP2-B) through…
☆16Oct 16, 2025Updated 9 months ago
thakur-nandan / sprint
View on GitHub
SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.
☆48Jul 25, 2023Updated 2 years ago
stephantul / pynife
View on GitHub
Nearly Inference Free Embeddings: make your RAG queries 500x faster
☆80Apr 27, 2026Updated 2 months ago
qanastek / DrBERT
View on GitHub
DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains
☆22Feb 7, 2024Updated 2 years ago
marqo-ai / GCL
View on GitHub
Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…
☆76Updated this week