duydvu / triton-inference-server-web-uiLinks
Triton Inference Server Web UI
☆17Updated 2 years ago
Alternatives and similar repositories for triton-inference-server-web-ui
Users that are interested in triton-inference-server-web-ui are comparing it to the libraries listed below
Sorting:
- OpenAI compatible API for TensorRT LLM triton backend☆218Updated last year
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆177Updated 4 months ago
- mnn asr demo.☆23Updated 8 months ago
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆205Updated 4 months ago
- Common source, scripts and utilities shared across all Triton repositories.☆77Updated this week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆73Updated 4 months ago
- OpenAI compatible API for open source LLMs☆16Updated 2 years ago
- ☆317Updated this week
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- vLLM Router☆51Updated last year
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆277Updated 2 years ago
- [⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI☆49Updated 5 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆88Updated this week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆106Updated last month
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆54Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆119Updated 2 years ago
- ☆113Updated last year
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 3 months ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆71Updated last year
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Updated last year
- Open Source Text Embedding Models with OpenAI Compatible API☆163Updated last year
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 9 months ago
- ☆64Updated 8 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆79Updated last year
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆30Updated 8 months ago
- Whisper inference with TensorRT-LLM☆23Updated 2 years ago
- Comparison of Language Model Inference Engines☆236Updated 11 months ago
- Utilizes ONNX Runtime to transcribe audio into text.☆59Updated last week
- qwen2 and llama3 cpp implementation☆48Updated last year
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆61Updated last year