TaoRuijie / Speaker-Recognition-DemoView external linksLinks
A ResNet Speaker Recognition&Verification Demo
☆26Oct 19, 2021Updated 4 years ago
Alternatives and similar repositories for Speaker-Recognition-Demo
Users that are interested in Speaker-Recognition-Demo are comparing it to the libraries listed below
Sorting:
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆787Apr 11, 2024Updated last year
- ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'☆449Oct 23, 2023Updated 2 years ago
- ☆17Jan 31, 2023Updated 3 years ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆32Nov 6, 2020Updated 5 years ago
- Tools for downloading VoxCeleb2 dataset☆33Mar 16, 2024Updated last year
- Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…☆10Jul 21, 2023Updated 2 years ago
- ☆42Nov 22, 2024Updated last year
- [ACL2023] Source code for Dialogue Summarization with Static-Dynamic Structure Fusion Graph☆11Dec 17, 2023Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated last year
- ☆11Jun 14, 2024Updated last year
- ☆10Jun 2, 2021Updated 4 years ago
- Speaker verification task with ECAPA-TDNN model (trained on Persian dataset)☆12Sep 15, 2022Updated 3 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆15Feb 3, 2026Updated 2 weeks ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…☆12Mar 25, 2025Updated 10 months ago
- Voice Music Separation competing for 6th Huawei Cup in ZJU☆11Jun 2, 2015Updated 10 years ago
- Classify the emotions from variable-length speech segments☆11Mar 29, 2018Updated 7 years ago
- Urdu Word Segmentation using Conditional Random Fields (CRFs)☆12Oct 3, 2018Updated 7 years ago
- A Pytorch implementation of 'Progressive Neural Networks for Transfer Learning in Emotion Recognition'☆11Jul 31, 2018Updated 7 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- An evolutionary algorithm that generates an accompaniment to a given melody that consists of triad chords while following music theory ru…☆10Sep 19, 2022Updated 3 years ago
- ☆11May 9, 2023Updated 2 years ago
- All you need to get started for the Zero Speech Challenge 2017☆47Apr 23, 2019Updated 6 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Feb 1, 2026Updated 2 weeks ago
- [ICASSP2024] Code for paper "SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection"☆15Jul 6, 2024Updated last year
- Prosody Predict☆10Jan 4, 2021Updated 5 years ago
- ☆11Nov 5, 2025Updated 3 months ago
- Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"☆13May 6, 2017Updated 8 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- ☆16Nov 12, 2024Updated last year
- Open Source Crimean Tatar Text-to-Speech datasets☆14Feb 23, 2025Updated 11 months ago