axinc-ai / whisper-export
openvino version of openai/whisper
☆12Updated 3 months ago
Alternatives and similar repositories for whisper-export:
Users that are interested in whisper-export are comparing it to the libraries listed below
- Onnx wrapper for espnet infrernce model☆159Updated 3 months ago
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- On-device speaker diarization powered by deep learning☆34Updated 2 weeks ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- ☆56Updated 2 years ago
- ☆38Updated 3 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆36Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆158Updated 10 months ago
- ONNX Inference of Pyannote Segmentation☆81Updated last month
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆52Updated last year
- Python bindings of speexdsp noise suppression library☆36Updated 2 years ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆192Updated 2 weeks ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆60Updated 2 years ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.☆46Updated 3 years ago
- a lightweight voice conversion☆78Updated 4 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆78Updated last year
- Putting flows on top of neural transducers for better TTS☆63Updated this week
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆58Updated 5 months ago
- asr2k☆49Updated 7 months ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Kaldi-compatible online fbank extractor without external dependencies☆84Updated last month
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆96Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆71Updated last year
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆83Updated last month
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆114Updated 2 years ago