duydvu / triton-inference-server-web-uiLinks
Triton Inference Server Web UI
☆14Updated last year
Alternatives and similar repositories for triton-inference-server-web-ui
Users that are interested in triton-inference-server-web-ui are comparing it to the libraries listed below
Sorting:
- Common source, scripts and utilities shared across all Triton repositories.☆74Updated last week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆102Updated 2 years ago
- Whisper inference with TensorRT-LLM☆22Updated last year
- Port of Funasr's Paraformer model in C/C++☆32Updated last year
- paraformer(chinense asr) online onnx runtime for python☆46Updated last year
- mnn asr demo.☆20Updated 3 months ago
- ☆267Updated 2 weeks ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Go framework for DL model inference and API deployment☆49Updated 6 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆164Updated this week
- The Triton backend for the ONNX Runtime.☆153Updated last week
- ☆31Updated 9 months ago
- The Triton backend for TensorRT.☆77Updated last week
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux☆61Updated 4 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆95Updated 9 months ago
- ☆109Updated last year
- Utilizes ONNX Runtime for audio denoising.☆55Updated this week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆256Updated 3 weeks ago
- Common source, scripts and utilities for creating Triton backends.☆328Updated last week
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆12Updated last year
- A Toolkit to Help Optimize Onnx Model☆159Updated this week
- OpenAI compatible API for TensorRT LLM triton backend☆209Updated 10 months ago
- pure go for rwkv☆19Updated last year
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆53Updated 7 months ago
- noise reduction☆17Updated 11 months ago
- Kaldi-compatible online fbank extractor without external dependencies☆107Updated this week
- ONNX Inference of Pyannote Segmentation☆91Updated 6 months ago
- Efficient inference of large language models.☆148Updated last week
- ☆67Updated 2 years ago
- Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferen…☆64Updated 2 weeks ago