NVIDIA-AI-IOT / whisper_trt
A project that optimizes Whisper for low latency inference using NVIDIA TensorRT
☆74Updated 5 months ago
Alternatives and similar repositories for whisper_trt:
Users that are interested in whisper_trt are comparing it to the libraries listed below
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- ONNX implementation of Whisper. PyTorch free.☆92Updated 4 months ago
- A toolkit for processing speech data and creating speech datasets☆106Updated this week
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆86Updated 2 weeks ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆49Updated 9 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆117Updated last year
- ☆101Updated this week
- Riva Python client API and CLI utils☆88Updated this week
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆16Updated 10 months ago
- openvino version of openai/whisper☆13Updated 5 months ago
- A Toolkit to Help Optimize Onnx Model☆125Updated this week
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- Sample C++ command-line Riva clients.☆32Updated this week
- NVIDIA Riva runnable tutorials☆127Updated this week
- Collection of Open Source Speech Data☆152Updated 4 months ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆203Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- A collection of reference AI microservices and workflows for Jetson Platform Services☆38Updated last month
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆162Updated last year
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆251Updated 5 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆94Updated 5 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆62Updated 7 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆16Updated 11 months ago
- Model compression for ONNX☆87Updated 4 months ago
- ☆78Updated this week
- Triton backend for https://github.com/OpenNMT/CTranslate2☆34Updated last year
- a Frontier Japanese Speech Generation net☆27Updated last week
- Kyutai with an "eye"☆104Updated this week
- Open TTS models, built for streaming on the edge☆38Updated last week