axinc-ai / whisper-exportLinks
openvino version of openai/whisper
☆15Updated last year
Alternatives and similar repositories for whisper-export
Users that are interested in whisper-export are comparing it to the libraries listed below
Sorting:
- ONNX Inference of Pyannote Segmentation☆95Updated 11 months ago
- ONNX implementation of Whisper. PyTorch free.☆101Updated last year
- ONNX and TensorRT implementation of Whisper☆65Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆177Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated 2 years ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆135Updated 6 months ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆97Updated 5 months ago
- Onnx wrapper for espnet infrernce model☆169Updated 3 months ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 4 years ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆48Updated 11 months ago
- openvino version of openai/whisper☆177Updated 2 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated 11 months ago
- ☆151Updated 3 weeks ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆67Updated 3 years ago
- Fine-Tune Whisper with Transformers and PEFT☆57Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆100Updated last year
- ☆17Updated last year
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆91Updated 2 years ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆119Updated last month
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆119Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- A curated list of awesome voice activity detection☆68Updated last year
- finetune llm part for spark-tts model☆110Updated 7 months ago
- Python bindings of speexdsp noise suppression library☆41Updated 3 years ago
- Putting flows on top of neural transducers for better TTS☆64Updated last week
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆16Updated last year
- ☆57Updated last year