liutaocode / DiarizationVisualizationLinks
Visualization tools for audio-only and multi-modal speaker diarization dataset
☆12Updated last year
Alternatives and similar repositories for DiarizationVisualization
Users that are interested in DiarizationVisualization are comparing it to the libraries listed below
Sorting:
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆94Updated 5 months ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆52Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆138Updated 3 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆93Updated 7 months ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆221Updated 11 months ago
- ☆66Updated 9 months ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆202Updated last year
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆146Updated last year
- Official Repository For VoxBlink2☆73Updated 10 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆98Updated 9 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆84Updated last year
- Zero-Shot Emotion Style Transfer☆47Updated 2 months ago
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆92Updated 6 months ago
- Diarization Metric in One: current support DER, JER, CDER, SER, and BER☆9Updated 2 years ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆146Updated last year
- SelfRemaster: SSL Speech Restoration☆89Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆157Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆133Updated 2 years ago
- Target Speaker Extraction Toolkit☆175Updated 2 months ago
- Training code for FAcodec presented in NaturalSpeech3☆212Updated 10 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorch