PINTO0309 / whisper-onnx-tensorrt
ONNX and TensorRT implementation of Whisper
☆61Updated last year
Alternatives and similar repositories for whisper-onnx-tensorrt:
Users that are interested in whisper-onnx-tensorrt are comparing it to the libraries listed below
- ONNX implementation of Whisper. PyTorch free.☆92Updated 3 months ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆73Updated 4 months ago
- ☆73Updated last week
- A toolkit for processing speech data and creating speech datasets☆106Updated this week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆94Updated 5 months ago
- ☆56Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆65Updated 6 months ago
- Onnx wrapper for espnet infrernce model☆161Updated 5 months ago
- Package for inference for punctuation, true-casing, and sentence boundary detection☆25Updated 9 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆47Updated 8 months ago
- Nue-ASR inference code by rinna Co., Ltd.☆31Updated 7 months ago
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 9 months ago
- ☆38Updated 3 years ago
- ☆84Updated 11 months ago
- A TTS model that makes a speaker speak new languages☆76Updated 8 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆82Updated last year
- Kaldi-compatible online fbank extractor without external dependencies☆88Updated this week
- a Frontier Japanese Speech Generation net☆26Updated this week
- ☆28Updated last year
- openvino version of openai/whisper☆13Updated 5 months ago
- ONNX Inference of Pyannote Segmentation☆80Updated 2 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆61Updated 2 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆41Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year