duydvu / triton-inference-server-web-ui
Triton Inference Server Web UI
☆12Updated last year
Alternatives and similar repositories for triton-inference-server-web-ui:
Users that are interested in triton-inference-server-web-ui are comparing it to the libraries listed below
- Whisper inference with TensorRT-LLM☆21Updated last year
- Common source, scripts and utilities shared across all Triton repositories.☆69Updated last week
- Port of Funasr's Paraformer model in C/C++☆32Updated 10 months ago
- The Triton backend for TensorRT.☆73Updated this week
- ☆65Updated 2 years ago
- The Triton backend for the ONNX Runtime.☆140Updated last week
- mnn asr demo.☆16Updated last month
- Common source, scripts and utilities for creating Triton backends.☆316Updated last week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆96Updated 2 years ago
- ☆61Updated this week
- llm deploy project based onnx.☆36Updated 6 months ago
- ONNX Inference of Pyannote Segmentation☆85Updated 4 months ago
- OpenAI compatible API for TensorRT LLM triton backend☆205Updated 8 months ago
- ☆124Updated last year
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆199Updated this week
- ☆74Updated 2 years ago
- vLLM adapter for a TGIS-compatible gRPC server.☆26Updated this week
- ☆246Updated last week
- A Toolkit to Help Optimize Onnx Model☆140Updated this week
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Updated last year
- ☆71Updated 2 years ago
- Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferen…☆62Updated last month
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆98Updated last week
- ASR client for Triton ASR Service☆28Updated 4 months ago
- A quantization algorithm for LLM☆139Updated 10 months ago
- ☆30Updated 7 months ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated last year
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆21Updated 2 years ago
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- Kaldi-compatible online fbank extractor without external dependencies☆94Updated 3 weeks ago