SeanLee97/AnglE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SeanLee97/AnglE)

SeanLee97 / AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

☆573

Alternatives and similar repositories for AnglE

Users that are interested in AnglE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

4AI / RAN
View on GitHub
RAN: Recurrent Attention Networks for Long-text Modeling | Findings of ACL23
☆23Aug 12, 2023Updated 2 years ago
WhereIsAI / BiLLM
View on GitHub
Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…
☆67Dec 12, 2024Updated last year
ContextualAI / gritlm
View on GitHub
Generative Representational Instruction Tuning
☆697Jun 25, 2025Updated last year
staoxiao / RetroMAE
View on GitHub
Codebase for RetroMAE and beyond.
☆275Jun 7, 2024Updated 2 years ago
FlagOpen / FlagEmbedding
View on GitHub
Retrieval and Retrieval-augmented LLMs
☆11,979Apr 22, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
4AI / AGN
View on GitHub
Official Code for Merging Statistical Feature via Adaptive Gate for Improved Text Classification (AAAI2021)
☆26Feb 5, 2022Updated 4 years ago
SmartLi8 / stella
View on GitHub
text embedding
☆145Sep 18, 2023Updated 2 years ago
4AI / LS-LLaMA
View on GitHub
A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning
☆152Mar 17, 2024Updated 2 years ago
kongds / scaling_sentemb
View on GitHub
Scaling Sentence Embeddings with Large Language Models
☆109Mar 22, 2024Updated 2 years ago
nomic-ai / contrastors
View on GitHub
Train Models Contrastively in Pytorch
☆798Mar 26, 2025Updated last year
mixedbread-ai / baguetter
View on GitHub
Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…
☆211Aug 31, 2024Updated last year
worldbank / GISTEmbed
View on GitHub
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings
☆45Mar 6, 2024Updated 2 years ago
mixedbread-ai / ofen
View on GitHub
WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included
☆17Oct 2, 2024Updated last year
THUIR / T2Ranking
View on GitHub
T2Ranking: A large-scale Chinese benchmark for passage ranking.
☆161Jul 3, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
beir-cellar / beir
View on GitHub
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
☆2,252Oct 16, 2025Updated 9 months ago
lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆876Updated this week
McGill-NLP / llm2vec
View on GitHub
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
☆1,706Apr 4, 2026Updated 3 months ago
embeddings-benchmark / mteb
View on GitHub
MTEB: State-of-the-art evaluation of embeddings across languages and modalities
☆3,368Updated this week
4AI / TDEER
View on GitHub
Official Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP 2021)
☆41Jul 27, 2024Updated last year
AnswerDotAI / rerankers
View on GitHub
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,626Dec 20, 2025Updated 7 months ago
castorini / rank_llm
View on GitHub
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
☆610Updated this week
jakespringer / echo-embeddings
View on GitHub
☆168Apr 17, 2024Updated 2 years ago
mixedbread-ai / batched
View on GitHub
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆161Jul 14, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
HazyResearch / m2
View on GitHub
Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
☆564Dec 28, 2024Updated last year
DunZhang / Stella
View on GitHub
☆63Jul 21, 2024Updated 2 years ago
facebookresearch / dpr-scale
View on GitHub
Scalable training for dense retrieval models.
☆298Jul 2, 2026Updated 3 weeks ago
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Updated this week
hltcoe / rank-k
View on GitHub
Repository for the listwise reranker Rank-K
☆16May 23, 2025Updated last year
webis-de / rank-distillm
View on GitHub
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking
☆25Apr 4, 2025Updated last year
RAIVNLab / MRL
View on GitHub
Code repository for the paper - "Matryoshka Representation Learning"
☆651Feb 19, 2024Updated 2 years ago
xlang-ai / instructor-embedding
View on GitHub
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
☆2,024Jan 15, 2025Updated last year
project-miracl / miracl
View on GitHub
A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.
☆211Jul 31, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
huggingface / sentence-transformers
View on GitHub
State-of-the-Art Embeddings, Retrieval, and Reranking
☆18,941Updated this week
orionw / FollowIR
View on GitHub
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆56Jul 3, 2024Updated 2 years ago
mixedbread-ai / binary-embeddings
View on GitHub
Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…
☆19Mar 23, 2024Updated 2 years ago
google-research-datasets / swim-ir
View on GitHub
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆50Nov 13, 2023Updated 2 years ago
hieudx149 / X-RetroMAE
View on GitHub
Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder
☆10Mar 16, 2023Updated 3 years ago
michaelfeil / infinity
View on GitHub
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
☆2,892Mar 24, 2026Updated 4 months ago
stanford-futuredata / ColBERT
View on GitHub
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
☆3,903Oct 14, 2025Updated 9 months ago