PINTO0309 / whisper-onnx-tensorrtLinks
ONNX and TensorRT implementation of Whisper
☆64Updated 2 years ago
Alternatives and similar repositories for whisper-onnx-tensorrt
Users that are interested in whisper-onnx-tensorrt are comparing it to the libraries listed below
Sorting:
- ONNX implementation of Whisper. PyTorch free.☆99Updated 10 months ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆90Updated 11 months ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆129Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- A toolkit for processing speech data and creating speech datasets☆174Updated last week
- Onnx compatible styletts2 code☆13Updated 4 months ago
- ☆133Updated 2 weeks ago
- A collection of all our phonemeizers for dataset construction and inference☆26Updated 7 months ago
- Open-source reproducible benchmarks from Argmax☆60Updated this week
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆177Updated last year
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆92Updated 7 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆87Updated this week
- Audio tokenization, in the fastest way possible!☆53Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆61Updated 2 years ago
- A TTS model that makes a speaker speak new languages☆76Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆57Updated last year
- Onnx wrapper for espnet infrernce model☆169Updated last month
- Nue-ASR inference code by rinna Co., Ltd.☆35Updated last week
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆264Updated last year
- Putting flows on top of neural transducers for better TTS☆64Updated this week
- Open TTS models, built for streaming on the edge☆43Updated 6 months ago
- Dippy Synthetic Speech Subnet☆17Updated 3 weeks ago
- ☆85Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆93Updated 9 months ago
- Using OpenVINO to speed up MeloTTS inference☆13Updated 11 months ago
- Implementation of Google's USM speech model in Pytorch☆31Updated last month
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆88Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆149Updated last year