hcd233 / Aris-AI-Model-ServerLinks
An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API
☆18Updated 5 months ago
Alternatives and similar repositories for Aris-AI-Model-Server
Users that are interested in Aris-AI-Model-Server are comparing it to the libraries listed below
Sorting:
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 10 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆167Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆73Updated this week
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆192Updated last month
- 大模型推理框架加速,让 LLM 飞起来☆24Updated last year
- 通过该项目将Dify通过Pipeline接入OpenwebUI,可以兼并OpenwebUI的前端优势和相应生 态以及Dify强大的模型可拓展性和Workflow的效益。☆39Updated last year
- MinerU API server☆85Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆108Updated 6 months ago
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆58Updated last year
- You can play any API server that compatible with OpenAI API☆24Updated last year
- LM inference server implementation based on *.cpp.☆295Updated 2 months ago
- A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.☆59Updated 10 months ago
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆32Updated last week
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆44Updated last year
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆72Updated last year
- Run LLM-related tools in containers.☆55Updated last year
- 简单的 AIGC 微服务,可通过 HTTP、gRPC 连接,支持流式回答。☆10Updated 2 years ago
- Dify 1.0 Plugin Convert your Dify tools's API to MCP compatible API☆23Updated 9 months ago
- Real time faster whisper gradio☆25Updated 5 months ago
- A third-party component library based on Gradio. Integrates Ant Design, Ant Design X, Monaco Editor and more advanced components to help…☆135Updated 2 months ago
- An open-source framework for building monolithic or distributed agentic systems, ranging from simple LLM calls to compositional workflows…☆25Updated 3 weeks ago
- The CLI & python API for the well-known project gpt-academic.☆19Updated last year
- Jina DeepSearch UI☆127Updated 5 months ago
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆28Updated last year
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆78Updated last year
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆156Updated last year
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated 2 years ago
- human in the loop in dify workflow by plugin☆14Updated last year
- mcp-difyworkflow-server is an mcp server Tools application that implements the query and invocation of Dify workflows, supporting the on-…☆60Updated last year
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆238Updated last month