pengzhendong / pyannote-onnx
ONNX Inference of Pyannote Segmentation
☆86Updated 4 months ago
Alternatives and similar repositories for pyannote-onnx:
Users that are interested in pyannote-onnx are comparing it to the libraries listed below
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Python Wrapper of Silero VAD☆51Updated 2 weeks ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆73Updated 8 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆99Updated 2 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆96Updated last month
- Predicts the level of noise and reverberation on your audiofiles☆148Updated 11 months ago
- ☆62Updated 2 weeks ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆87Updated 3 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆131Updated 2 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆63Updated last month
- Colab notebooks for Next-gen Kaldi☆27Updated 3 weeks ago
- ☆26Updated 3 months ago
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆92Updated 5 months ago
- Kaldi-compatible online fbank extractor without external dependencies☆97Updated this week
- Target Speaker Extraction Toolkit☆164Updated 3 weeks ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆67Updated 2 years ago
- Python Wrapper for RnNoise v0.2☆31Updated 2 weeks ago
- Fine-Tune Whisper with Transformers and PEFT☆55Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆126Updated 5 months ago
- Chinese and English Bilinguish G2P☆21Updated last year
- Utilizes ONNX Runtime for audio denoising.☆45Updated this week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆112Updated 2 years ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆153Updated last month
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆93Updated 4 months ago
- Went online decode demo☆29Updated 4 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆137Updated 4 months ago
- ASR client for Triton ASR Service☆28Updated 4 months ago
- This is the audio sample repository for speech separation model "MossFormer2".☆123Updated 5 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆79Updated 11 months ago