liutaocode / DiarizationVisualization
Visualization tools for audio-only and multi-modal speaker diarization dataset
☆12Updated last year
Alternatives and similar repositories for DiarizationVisualization
Users that are interested in DiarizationVisualization are comparing it to the libraries listed below
Sorting:
- Diarization Metric in One: current support DER, JER, CDER, SER, and BER☆9Updated 2 years ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆52Updated last year
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆88Updated 4 months ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆144Updated last year
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆195Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆132Updated 2 months ago
- ☆64Updated last month
- How to use our public wav2vec2 age and gender model☆40Updated last year
- Official Repository For VoxBlink2☆67Updated 9 months ago
- Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset☆81Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆74Updated last year
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆146Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆157Updated 2 years ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆134Updated 4 months ago
- Zero-Shot Emotion Style Transfer☆45Updated 3 weeks ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆219Updated 10 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆79Updated 11 months ago
- ☆71Updated last year
- Clustering-based methods for overlapping diarization☆81Updated last year
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆92Updated 5 months ago
- ☆140Updated last year
- ☆69Updated last year
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆99Updated 10 months ago
- Target Speaker Extraction Toolkit☆167Updated last month
- ONNX Inference of Pyannote Segmentation☆87Updated 4 months ago
- ☆74Updated 3 months ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆158Updated last year
- A simple package for Guided source separation (GSS)☆121Updated 11 months ago
- This is the audio sample repository for speech separation model "MossFormer2".☆125Updated 5 months ago
- SelfRemaster: SSL Speech Restoration☆88Updated last year