HHousen / speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
☆44Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for speaker-change-detection
- Clustering-based methods for overlapping diarization☆68Updated 9 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆70Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆43Updated last month
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆63Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆34Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆81Updated last week
- ☆43Updated 9 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆83Updated 3 weeks ago
- ☆29Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆47Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆70Updated last year
- ☆56Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated last year
- A simple package for Guided source separation (GSS)☆107Updated 5 months ago
- multilingual speech aligner☆71Updated 11 months ago
- ☆69Updated last year
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆135Updated 6 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆38Updated 2 months ago
- Official repository of NeXt-TDNN for speaker verification☆54Updated last month
- Update ASR paper everyday☆30Updated this week
- A sequence-to-sequence voice conversion toolkit.☆85Updated 4 months ago
- ☆17Updated 3 months ago
- ☆27Updated 7 months ago
- SelfRemaster: SSL Speech Restoration☆84Updated 10 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆72Updated 5 months ago
- Unofficial implementation of miipher☆111Updated 6 months ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆119Updated 2 years ago
- A list of papers for child ASR☆26Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month