pengzhendong / pyannote-onnxLinks

ONNX Inference of Pyannote Segmentation

☆92

Alternatives and similar repositories for pyannote-onnx

Users that are interested in pyannote-onnx are comparing it to the libraries listed below

Sorting:

leohuang2013 / pyannote-audio_speaker-diarization_cpp
C++ version of pyannote audio speaker diarizaiton pipeline
☆21Updated last year
pengzhendong / pysilero
Python Wrapper of Silero VAD
☆57Updated 2 months ago
yuyun2000 / SpeechDenoiser
SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…
☆82Updated 11 months ago
csukuangfj / kaldi-native-fbank
Kaldi-compatible online fbank extractor without external dependencies
☆112Updated 2 weeks ago
k2-fsa / colab
Colab notebooks for Next-gen Kaldi
☆28Updated 3 months ago
lovemefan / fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆105Updated 2 years ago
joonaskalda / PixIT
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…
☆95Updated 6 months ago
pengzhendong / g2p-mix
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
☆105Updated 4 months ago
backspacetg / simul_whisper
Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection
☆69Updated 4 months ago
BUTSpeechFIT / TS-ASR-Whisper
☆77Updated last month
wenet-e2e / wesep
Target Speaker Extraction Toolkit
☆183Updated last week
k2-fsa / ZipVoice
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆330Updated last week
FrenchKrab / IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆88Updated last year
frankyoujian / Edge-Punct-Casing
☆29Updated 5 months ago
marianne-m / brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
☆155Updated last month
FENRlR / MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
☆128Updated 8 months ago
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆100Updated 9 months ago
jumon / whisper-punctuator
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆116Updated 2 years ago
BriansIDP / WhisperBiasing
☆81Updated this week
fengredrum / finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
☆57Updated last year
pirxus / personalVAD
An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.
☆70Updated 2 years ago
pengzhendong / pyrnnoise
Python Wrapper for RnNoise v0.2
☆42Updated last week
SpeechColab / GigaSpeech2
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
☆164Updated last month
ScottishFold007 / TTSAudioNormalizer
TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…
☆101Updated 7 months ago
csukuangfj / kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…
☆203Updated last month
adelacvg / ttts
Train the next generation of TTS systems.
☆165Updated 10 months ago
Audio-WestlakeU / FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆140Updated this week
bigcash / awesome-vad
A curated list of awesome voice activity detection
☆59Updated 8 months ago
uthree / tinyvc
a lightweight voice conversion
☆84Updated 11 months ago
espnet / espnet_onnx
Onnx wrapper for espnet infrernce model
☆167Updated 9 months ago