PINTO0309 / whisper-onnx-tensorrt
ONNX and TensorRT implementation of Whisper
☆61Updated last year
Alternatives and similar repositories for whisper-onnx-tensorrt:
Users that are interested in whisper-onnx-tensorrt are comparing it to the libraries listed below
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆69Updated 3 months ago
- ONNX implementation of Whisper. PyTorch free.☆88Updated 2 months ago
- ☆65Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆60Updated 2 years ago
- Onnx wrapper for espnet infrernce model☆159Updated 3 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- A toolkit for processing speech data and creating speech datasets☆104Updated this week
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆15Updated 8 months ago
- ☆56Updated 2 years ago
- ☆84Updated 9 months ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆34Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- one script for xls-r/xlsr/whisper fine-tuning☆40Updated last year
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆83Updated last month
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆78Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- ☆38Updated 3 years ago
- Audio tokenization, in the fastest way possible!☆46Updated 5 months ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆36Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆71Updated last year
- Implementation of Google's USM speech model in Pytorch☆27Updated this week
- Tunable pipelines☆31Updated 2 weeks ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆158Updated 10 months ago
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago
- Kaldi-compatible online fbank extractor without external dependencies☆84Updated last month
- Colab notebooks for Next-gen Kaldi☆26Updated last month
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- openvino version of openai/whisper☆12Updated 3 months ago
- Dippy Synthetic Speech Subnet☆15Updated this week