substratusai / stapiLinks

Sentence Transformers API: An OpenAI compatible embedding API server

☆64

Alternatives and similar repositories for stapi

Users that are interested in stapi are comparing it to the libraries listed below

Sorting:

Dan-wanna-M / formatron
Formatron empowers everyone to control the format of language models' output with minimal overhead.
☆221Updated 2 months ago
huseinzol05 / transformers-continuous-batching
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
☆26Updated 4 months ago
rag-wtf / open-text-embeddings
Open Source Text Embedding Models with OpenAI Compatible API
☆157Updated last year
etalab-ia / albert-models
Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
☆42Updated last year
EveripediaNetwork / fastc
Unattended Lightweight Text Classifiers with LLM Embeddings
☆185Updated 11 months ago
qdrant / bm42_eval
Evaluation of bm42 sparse indexing algorithm
☆68Updated last year
agamm / semantic-split
A Python library to chunk/group your texts based on semantic similarity.
☆97Updated last year
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆88Updated 3 months ago
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆173Updated 10 months ago
vespa-engine / pyvespa
Python API for https://vespa.ai, the open big data serving engine
☆135Updated this week
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆87Updated last month
jina-ai / llm-query-expansion
Query Expension for Better Query Embedding using LLMs
☆55Updated 5 months ago
Knowledgator / LiqFit
Efficient few-shot learning with cross-encoders.
☆56Updated last year
YZ-Cai / SimGRAG
Official code of the ACL 2025 paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"
☆118Updated 2 weeks ago
QuixiAI / spectrum
☆129Updated 4 months ago
nyunAI / PruneGPT
☆51Updated last year
AlexBodner / How_Much_VRAM
☆102Updated 11 months ago
joaodsmarques / LumberChunker
This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João…
☆70Updated 10 months ago
codelion / adaptive-classifier
A flexible, adaptive classification system for dynamic text classification
☆353Updated 2 weeks ago
promptslab / LLMtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
☆240Updated last year
mixedbread-ai / baguetter
Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…
☆186Updated 11 months ago
khaimt / qa_expert
This repo is for handling Question Answering, especially for Multi-hop Question Answering
☆67Updated last year
aurelio-labs / semantic-chunkers
☆231Updated last month
DunZhang / Stella
☆62Updated last year
cxcscmu / RAGViz
Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
☆86Updated 6 months ago
michaelfeil / embed
A stable, fast and easy-to-use inference library with a focus on a sync-to-async API
☆45Updated 10 months ago
gusye1234 / nano-vectordb
A simple, easy-to-hack Vector Database
☆156Updated 8 months ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
denser-org / denser-retriever
An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.
☆287Updated last month
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆76Updated 9 months ago