duydvu / triton-inference-server-web-uiLinks
Triton Inference Server Web UI
☆15Updated last year
Alternatives and similar repositories for triton-inference-server-web-ui
Users that are interested in triton-inference-server-web-ui are comparing it to the libraries listed below
Sorting:
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆156Updated last month
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆13Updated last year
- OpenAI compatible API for TensorRT LLM triton backend☆214Updated last year
- Port of Funasr's Paraformer model in C/C++☆34Updated last year
- Whisper inference with TensorRT-LLM☆22Updated last year
- mnn asr demo.☆23Updated 5 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆68Updated this week
- ☆112Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆110Updated 2 years ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆29Updated 5 months ago
- ☆293Updated 3 weeks ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆15Updated 3 weeks ago
- Common source, scripts and utilities shared across all Triton repositories.☆76Updated 3 weeks ago
- Golang web client for Ollama, fast and easy to use.☆29Updated last month
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆86Updated last year
- OpenAI compatible API for open source LLMs☆16Updated last year
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆272Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆30Updated last year
- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provi…☆38Updated 5 months ago
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆189Updated last month
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- LM inference server implementation based on *.cpp.☆271Updated 2 weeks ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆71Updated last year
- Comparison of Language Model Inference Engines☆229Updated 8 months ago
- ☆61Updated last year
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆69Updated last year
- The Triton backend for the ONNX Runtime.☆159Updated this week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆99Updated 11 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆176Updated 2 months ago
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆32Updated last year