dangvansam / pyannote-onnxLinks

PyAnnote Voice Activity Detection (ONNX version)

☆19

Alternatives and similar repositories for pyannote-onnx

Users that are interested in pyannote-onnx are comparing it to the libraries listed below

Sorting:

vasistalodagala / whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
☆319Updated 2 years ago
EtienneAb3d / WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
☆332Updated 8 months ago
zhuzilin / whisper-openvino
openvino version of openai/whisper
☆168Updated last year
Vaibhavs10 / fast-whisper-finetuning
☆527Updated last year
shashikg / WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
☆437Updated 10 months ago
absadiki / pywhispercpp
Python bindings for whisper.cpp
☆276Updated 2 weeks ago
IIEleven11 / StyleTTS2FineTune
☆240Updated last month
jumon / whisper-finetuning
[WIP] Scripts for fine-tuning Whisper
☆219Updated 2 years ago
juanmc2005 / diart
A python package to build AI-powered real-time audio applications
☆1,363Updated 5 months ago
akashmjn / tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
☆503Updated last year
pengzhendong / pyannote-onnx
ONNX Inference of Pyannote Segmentation
☆92Updated 6 months ago
nyrahealth / CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
☆773Updated last month
NVIDIA / NeMo-text-processing
NeMo text processing for ASR and TTS
☆347Updated this week
haoheliu / voicefixer
General Speech Restoration
☆1,186Updated 5 months ago
YuanGongND / whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …
☆396Updated last year
Blair-Johnson / batch-whisper
Batch Support for OpenAI Whisper
☆95Updated last year
anhnh2002 / XTTSv2-Finetuning-for-New-Languages
☆157Updated 7 months ago
MahmoudAshraf97 / ctc-forced-aligner
Text to speech alignment using CTC forced alignment
☆317Updated 3 months ago
gemelo-ai / vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
☆949Updated 11 months ago
KoljaB / WhoSpeaks
Efficient approach to speaker diarization using voice characteristics extraction
☆97Updated last month
aarnphm / whispercpp
Pybind11 bindings for Whisper.cpp
☆334Updated 7 months ago
ylacombe / finetune-hf-vits
Finetune VITS and MMS using HuggingFace's tools
☆159Updated last year
RomanKlimov / faster-whisper-acceleration
Accelerating faster-whisper single file processing by multiprocessing through parallelization
☆54Updated 2 years ago
huggingface / diarizers
☆300Updated last year
PABannier / bark.cpp
Suno AI's Bark model in C/C++ for fast text-to-speech generation
☆832Updated 8 months ago
MiscellaneousStuff / openai-whisper-cpu
Improving transcription performance of OpenAI Whisper for CPU based deployment
☆246Updated 2 years ago
NavodPeiris / speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…
☆221Updated 3 months ago
hedrergudene / asr-sd-pipeline
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
☆96Updated last year
davidmartinrius / speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
☆247Updated last year
Picovoice / cobra
On-device voice activity detection (VAD) powered by deep learning
☆220Updated last week