NVIDIA-AI-IOT / whisper_trtLinks
A project that optimizes Whisper for low latency inference using NVIDIA TensorRT
☆82Updated 7 months ago
Alternatives and similar repositories for whisper_trt
Users that are interested in whisper_trt are comparing it to the libraries listed below
Sorting:
- ONNX and TensorRT implementation of Whisper☆63Updated 2 years ago
- ONNX implementation of Whisper. PyTorch free.☆97Updated 6 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆119Updated last year
- A toolkit for processing speech data and creating speech datasets☆114Updated this week
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆47Updated 8 months ago
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆75Updated 3 weeks ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆208Updated last year
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆271Updated 7 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆64Updated 9 months ago
- Using OpenVINO to speed up MeloTTS inference☆11Updated 7 months ago
- Easy to use neural networks for NVIDIA Jetson (and desktop too!)☆73Updated 2 years ago
- NVIDIA Riva runnable tutorials☆133Updated 3 weeks ago
- A Toolkit to Help Optimize Onnx Model☆148Updated last week
- EdgeSAM model for use with Autodistill.☆26Updated 11 months ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆56Updated 3 weeks ago
- Sample C++ command-line Riva clients.☆33Updated last week
- Riva Python client API and CLI utils☆94Updated this week
- ☆110Updated 2 months ago
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆109Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆27Updated 10 months ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated 2 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- A reference example for integrating NanoOwl with Metropolis Microservices for Jetson☆30Updated 11 months ago
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- ☆96Updated 8 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆62Updated last month
- A collection of all our phonemeizers for dataset construction and inference☆23Updated 3 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 6 months ago
- openvino version of openai/whisper☆13Updated 7 months ago