Sentence Transformers API: An OpenAI compatible embedding API server
ā71Sep 4, 2024Updated last year
Alternatives and similar repositories for stapi
Users that are interested in stapi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A RAG that can scale š§š»āš»ā11May 28, 2024Updated 2 years ago
- Open Source Text Embedding Models with OpenAI Compatible APIā168Jul 13, 2024Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologiesā21Apr 27, 2026Updated last month
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.ā29Mar 15, 2025Updated last year
- Pairwise Controlled Manifold Approximation (PaCMAP) for dimensionality reductionā20Feb 3, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI ⢠AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ā67Mar 28, 2025Updated last year
- ā10Oct 2, 2024Updated last year
- ModernBERT model optimized for Apple Neural Engine.ā33Jan 10, 2025Updated last year
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpaliā2,804Mar 24, 2026Updated 2 months ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddingsā13May 22, 2025Updated last year
- vLLM client with minimal dependenciesā15Feb 28, 2024Updated 2 years ago
- ā17Jan 5, 2023Updated 3 years ago
- Wyoming server for using Pocket-TTS with Home Assistant or other Wyoming aware services.ā33Updated this week
- The official repo for the DanQing dataset.ā36Mar 25, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer ⢠AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- YAST - Yet Another SPLADE or Sparse Trainerā21Jun 16, 2025Updated 11 months ago
- š¤ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)ā17Mar 20, 2024Updated 2 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systemsā35Nov 21, 2025Updated 6 months ago
- Semantically Search Emojis From the Command Line!ā13Nov 26, 2023Updated 2 years ago
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware iā¦ā29Mar 8, 2026Updated 2 months ago
- Chunk your text using gpt4o-mini more accuratelyā44Aug 3, 2024Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddingsā23Jun 30, 2025Updated 11 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals ā¦ā15Jul 19, 2024Updated last year
- fast-embeddings-apiā16Nov 23, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean ⢠AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS 2024] šø GlotCC Dataset and Piplineā20Apr 6, 2025Updated last year
- ā15May 31, 2024Updated last year
- Rest API template developed in Python with the Flask framework. The template covers user management and jwt tokens for authentication.ā20May 16, 2024Updated 2 years ago
- A blazing fast inference solution for text embeddings modelsā4,826Updated this week
- AI_Powered_Dev_Search_Engineā12Mar 10, 2024Updated 2 years ago
- Keyphrase Extraction Prototypesā15Nov 24, 2016Updated 9 years ago
- Rhythm analysis toolkit in Pythonā13Sep 29, 2023Updated 2 years ago
- A massively multilingual modern encoder language modelā140Jan 20, 2026Updated 4 months ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]ā12Jun 5, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer ⢠AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The Flutter plugin for iOS and Android to decoding QR codes.ā18May 14, 2025Updated last year
- ā63Jul 21, 2024Updated last year
- ā31Feb 2, 2024Updated 2 years ago
- Use the tokenizer in parallel to achieve superior accelerationā20Mar 21, 2024Updated 2 years ago
- gh-do is a tool to do anything using GitHub credentialsā20Apr 23, 2026Updated last month
- Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"ā13Jul 23, 2023Updated 2 years ago
- ā19Mar 5, 2022Updated 4 years ago