Sentence Transformers API: An OpenAI compatible embedding API server
☆71Sep 4, 2024Updated last year
Alternatives and similar repositories for stapi
Users that are interested in stapi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open Source Text Embedding Models with OpenAI Compatible API☆167Jul 13, 2024Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆21Oct 24, 2022Updated 3 years ago
- An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API☆18Aug 21, 2025Updated 7 months ago
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 10 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Pairwise Controlled Manifold Approximation (PaCMAP) for dimensionality reduction☆20Feb 3, 2026Updated 2 months ago
- ☆67Mar 28, 2025Updated last year
- ☆10Oct 2, 2024Updated last year
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,752Mar 24, 2026Updated 3 weeks ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 5 months ago
- RSS Launchpad web extension: quickly add new RSS/Atom subscriptions from websites☆20May 18, 2025Updated 10 months ago
- ☆17Jan 5, 2023Updated 3 years ago
- A HttpClient manager that allows cool stuff to happen☆11Jan 2, 2018Updated 8 years ago
- The official repo for the DanQing dataset.☆34Mar 25, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- YAST - Yet Another SPLADE or Sparse Trainer☆21Jun 16, 2025Updated 9 months ago
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graph☆40Jul 10, 2024Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 4 months ago
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- Chunk your text using gpt4o-mini more accurately☆44Aug 3, 2024Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 9 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Jul 19, 2024Updated last year
- fast-embeddings-api☆16Nov 23, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 2 years ago
- Data Analytics Using MySQL☆15Mar 31, 2023Updated 3 years ago
- A blazing fast inference solution for text embeddings models☆4,663Updated this week
- Wake word detection with custom phrases without model training☆41Mar 8, 2026Updated last month
- AI Reddit Profiler is a Python tool that uses AI to analyze Reddit user profiles. It extracts information like karma, subreddit activity,…☆13Oct 23, 2024Updated last year
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- A simple modern Python project template with uv, Docker, Claude Code, Cursor, devcontainer, GitHub Actions, and pre-commit support.☆32Sep 26, 2025Updated 6 months ago
- some stuff about generative ai☆15Feb 20, 2025Updated last year
- 北语 246 实验室新生简明指南☆10May 30, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Keyphrase Extraction Prototypes☆15Nov 24, 2016Updated 9 years ago
- Rhythm analysis toolkit in Python☆13Sep 29, 2023Updated 2 years ago
- A massively multilingual modern encoder language model☆139Jan 20, 2026Updated 2 months ago
- Managing automatic patching via Python☆16Jul 10, 2024Updated last year
- A browser based CadQuery server☆12Feb 18, 2025Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Library for evaluating RAG using Nuclia's models☆18Jul 31, 2024Updated last year