Sentence Transformers API: An OpenAI compatible embedding API server
☆70Sep 4, 2024Updated last year
Alternatives and similar repositories for stapi
Users that are interested in stapi are comparing it to the libraries listed below
Sorting:
- Pairwise Controlled Manifold Approximation (PaCMAP) for dimensionality reduction☆20Feb 3, 2026Updated last month
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated 11 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆166Jul 13, 2024Updated last year
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆14Aug 25, 2024Updated last year
- Rust implementation + Python bindings for image effects used in chaiNNer☆12Jun 11, 2024Updated last year
- Vscode Samge Translate 翻译助手:Quickly translate text right in your code 🚀 支持多种翻译命令(英译中、中译英、中文转多规则命名变量等),支持多种结果展示方式,支持配置百度、阿里、腾讯、火山、有道、Deep…☆14Mar 24, 2025Updated 11 months ago
- A VS Code extension that integrates Claude AI as your coding assistant using the Agent SDK.☆44Updated this week
- An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API☆18Aug 21, 2025Updated 6 months ago
- vLLM client with minimal dependencies☆15Feb 28, 2024Updated 2 years ago
- A minimal LLM sales agent framework for sales agent fast deployment and benchmark. Support OpenAI models, Claude, HuggingFace models, Gem…☆19Sep 6, 2024Updated last year
- ☆11Feb 25, 2026Updated last week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,688Feb 5, 2026Updated 3 weeks ago
- ☆20Feb 10, 2025Updated last year
- Automation Chatbot☆21Jan 1, 2025Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 9 months ago
- Model implementation for the contextual embeddings project☆41Jun 2, 2025Updated 9 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Sep 17, 2024Updated last year
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Jan 5, 2025Updated last year
- Easily share your custom workflows for anyone to run☆22Oct 17, 2024Updated last year
- A graph rag for PDFs based on langchain and Neo4j. Can fetch PDFs from Zotero Library through zotero api.☆29Jun 26, 2024Updated last year
- Build Neo4J Knowledge Graphs from Excel files☆22Nov 18, 2024Updated last year
- ModernBERT model optimized for Apple Neural Engine.☆31Jan 10, 2025Updated last year
- Embed your LLM into a python function☆22Jan 9, 2025Updated last year
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆75Updated this week
- Code for the MTEB leaderboard☆30Feb 4, 2025Updated last year
- Run large models from the terminal using Apple MLX.☆31Mar 18, 2024Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Sep 20, 2024Updated last year
- Unattended Lightweight Text Classifiers with LLM Embeddings☆186Sep 6, 2024Updated last year
- ☆28Oct 14, 2024Updated last year
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- Tools and agents for automated research.☆50Dec 5, 2025Updated 2 months ago
- ☆23Updated this week
- vanna.ai demo☆31May 1, 2024Updated last year
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆30May 21, 2024Updated last year
- A simple dify bot☆34Apr 16, 2025Updated 10 months ago
- 大模型智能体Agent中文教程,博客代码仓库☆59Nov 5, 2025Updated 3 months ago
- A Model Context Protocol server for Dify☆41Feb 6, 2025Updated last year