hcd233 / Aris-AI-Model-ServerLinks
An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API
☆17Updated 2 months ago
Alternatives and similar repositories for Aris-AI-Model-Server
Users that are interested in Aris-AI-Model-Server are comparing it to the libraries listed below
Sorting:
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆169Updated 3 months ago
- xllamacpp - a Python wrapper of llama.cpp☆62Updated last week
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 7 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆160Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆105Updated 3 months ago
- A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.☆57Updated 7 months ago
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆17Updated 3 months ago
- LM inference server implementation based on *.cpp.☆286Updated 2 months ago
- 大模型推理框架加速,让 LLM 飞起来☆20Updated last year
- Jina DeepSearch UI☆125Updated 2 months ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆20Updated this week
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆67Updated last year
- 通过该项目将Dify通过Pipeline接入OpenwebUI,可以兼并OpenwebUI的前端优势和相应生态以及Dify强大的模型可拓展性和Workflow的效益。☆38Updated 11 months ago
- MinerU API server☆78Updated 10 months ago
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆55Updated last year
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆30Updated last month
- mcp-difyworkflow-server is an mcp server Tools application that implements the query and invocation of Dify workflows, supporting the on-…☆58Updated 10 months ago
- This Open LLM Framework serves as a powerful and flexible tool for serving endpoints for embeddings and chat completions using SOTA open …☆22Updated last year
- The CLI & python API for the well-known project gpt-academic.☆18Updated last year
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆155Updated last year
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆212Updated 2 months ago
- Dify 1.0 Plugin Convert your Dify tools's API to MCP compatible API☆23Updated 6 months ago
- "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆58Updated 11 months ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆80Updated 11 months ago
- Run LLM-related tools in containers.☆55Updated last year
- Sentence Transformers API: An OpenAI compatible embedding API server☆68Updated last year
- A third-party component library based on Gradio. Integrates Ant Design, Ant Design X, and more advanced components to help you build appl…☆127Updated 3 weeks ago
- This is a proof of concept repo on how to create a gradio UI using the Model Context Protocol Client Python SDK.☆66Updated 10 months ago
- Tutorials from AutoGen Basics to Use Cases☆32Updated last year
- Data browser based on s3. 一个基于 S3 的数据(json / jsonl / html / md等)可视化工具。👇 Try online.☆77Updated last week