SmartLi8/stella

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SmartLi8/stella)

SmartLi8 / stella

text embedding

☆145

Alternatives and similar repositories for stella

Users that are interested in stella are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenSenseNova / piccolo-embedding
View on GitHub
code for piccolo embedding model from SenseTime
☆143May 21, 2024Updated 2 years ago
THUIR / T2Ranking
View on GitHub
T2Ranking: A large-scale Chinese benchmark for passage ranking.
☆161Jul 3, 2023Updated 3 years ago
wangyuxinwhy / uniem
View on GitHub
unified embedding model
☆876Sep 1, 2023Updated 2 years ago
amulil / vector_by_onnxmodel
View on GitHub
accelerate generating vector by using onnx model
☆18Jan 23, 2024Updated 2 years ago
FlagOpen / FlagEmbedding
View on GitHub
Retrieval and Retrieval-augmented LLMs
☆11,960Apr 22, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SeanLee97 / AnglE
View on GitHub
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
☆573Mar 22, 2026Updated 4 months ago
NovaSearch-Team / RAG-Retrieval
View on GitHub
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.
☆1,126Updated this week
GioGioBond / NBCEonChatGLM6b
View on GitHub
(NBCE)Naive Bayes-based Context Extension on ChatGLM-6b
☆15Jun 7, 2023Updated 3 years ago
izhx / uni-rep
View on GitHub
Code for embedding and retrieval research.
☆16Oct 24, 2023Updated 2 years ago
staoxiao / RetroMAE
View on GitHub
Codebase for RetroMAE and beyond.
☆275Jun 7, 2024Updated 2 years ago
DunZhang / Stella
View on GitHub
☆63Jul 21, 2024Updated 2 years ago
Alibaba-NLP / Multi-CPR
View on GitHub
[SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval
☆204Jan 4, 2023Updated 3 years ago
worldbank / GISTEmbed
View on GitHub
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings
☆45Mar 6, 2024Updated 2 years ago
sfzhou5678 / PolyEncoder
View on GitHub
An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate …
☆248Jun 12, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
embeddings-benchmark / mteb
View on GitHub
MTEB: State-of-the-art evaluation of embeddings across languages and modalities
☆3,363Updated this week
liuqi6777 / pe_rank
View on GitHub
Leveraging passage embeddings for efficient listwise reranking with large language models.
☆51Dec 7, 2024Updated last year
thunlp / Adaptive-Note
View on GitHub
☆60Oct 18, 2024Updated last year
yuanzhoulvpi2017 / quick_sentence_transformers
View on GitHub
sentence-transformers to onnx 让sbert模型推理效率更快
☆166Mar 11, 2022Updated 4 years ago
ContextualAI / gritlm
View on GitHub
Generative Representational Instruction Tuning
☆697Jun 25, 2025Updated last year
mukhal / PromptRank
View on GitHub
[ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting
☆27Oct 19, 2025Updated 9 months ago
IAAR-Shanghai / CRUD_RAG
View on GitHub
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
☆399May 20, 2025Updated last year
yuanzhoulvpi2017 / SentenceEmbedding
View on GitHub
☆121Jun 30, 2024Updated 2 years ago
luyug / Reranker
View on GitHub
Build Text Rerankers with Deep Language Models
☆265Feb 20, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
smallporridge / AssistRAG
View on GitHub
☆23Jan 3, 2025Updated last year
WhereIsAI / BiLLM
View on GitHub
Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…
☆67Dec 12, 2024Updated last year
kanekomasahiro / eb-gec
View on GitHub
☆15Mar 15, 2022Updated 4 years ago
pku0xff / CC-Riddle
View on GitHub
Data for paper "CC-Riddle: A Question Answering Dataset of Chinese Character Riddles": https://arxiv.org/abs/2206.13778
☆21Aug 19, 2023Updated 2 years ago
netease-youdao / BCEmbedding
View on GitHub
Netease Youdao's open-source embedding and reranker models for RAG products.
☆1,881Sep 9, 2025Updated 10 months ago
CLUEbenchmark / QBQTC
View on GitHub
QBQTC: 大规模搜索匹配数据集
☆86Dec 12, 2021Updated 4 years ago
mohsinulkabir14 / DEPTWEET
View on GitHub
This repository contains the dataset 'DEPTWEET' published in the journal of Computers in Human Behavior.
☆12Jul 12, 2023Updated 3 years ago
CLUEbenchmark / SimCLUE
View on GitHub
3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型
☆313Oct 11, 2022Updated 3 years ago
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
RUC-NLPIR / SmartSearch
View on GitHub
☆45Jan 19, 2026Updated 6 months ago
hellonlp / sentence-similarity
View on GitHub
文本相似度，语义向量，文本向量，text-similarity，similarity, sentence-similarity，BERT，SimCSE，BERT-Whitening，Sentence-BERT, PromCSE, SBERT
☆77Nov 26, 2024Updated last year
zhp510730568 / bert-ad
View on GitHub
bert multiple gpu train pretrain
☆29Apr 12, 2020Updated 6 years ago
LC1332 / Luotuo-Text-Embedding
View on GitHub
Luotuo Embedding(骆驼嵌入) is a text embedding model, which developed by 李鲁鲁, 冷子昂, 陈启源, 蒟蒻等.
☆265Aug 25, 2023Updated 2 years ago
microsoft / MoPQ
View on GitHub
☆13Nov 26, 2021Updated 4 years ago
NEUIR / ExpandR
View on GitHub
[EMNLP '25] Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"
☆40Aug 13, 2025Updated 11 months ago
shibing624 / text2vec
View on GitHub
text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。
☆4,974Feb 14, 2026Updated 5 months ago