axinc-ai / whisper-exportLinks
openvino version of openai/whisper
☆15Updated last year
Alternatives and similar repositories for whisper-export
Users that are interested in whisper-export are comparing it to the libraries listed below
Sorting:
- Onnx wrapper for espnet infrernce model☆169Updated 4 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆67Updated 3 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆177Updated last year
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆97Updated 7 months ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆104Updated last year
- ONNX and TensorRT implementation of Whisper☆66Updated 2 years ago
- finetune llm part for spark-tts model☆115Updated 9 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆92Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Uses machine learning to denoise audio containing speech☆48Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.☆193Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated 2 years ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆108Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆202Updated 4 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 3 years ago
- ONNX implementation of Whisper. PyTorch free.☆102Updated last year
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 7 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆86Updated last year
- Python bindings of speexdsp noise suppression library☆45Updated 3 years ago
- Finetuning VITS Efficiently☆33Updated 2 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16Updated 3 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆269Updated 5 months ago