wq2012 / SpeakerRecognitionFromScratchView external linksLinks
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
☆47May 7, 2024Updated last year
Alternatives and similar repositories for SpeakerRecognitionFromScratch
Users that are interested in SpeakerRecognitionFromScratch are comparing it to the libraries listed below
Sorting:
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 4 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- ☆13Mar 11, 2025Updated 11 months ago
- A simple implementation for improving CosyVoice2 by GRPO method☆32Oct 17, 2025Updated 3 months ago
- ☆15Apr 2, 2025Updated 10 months ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 5 months ago
- A selective noise filter architecture driven by a CNN and Wiener filter☆18Nov 21, 2019Updated 6 years ago
- ☆15Aug 22, 2025Updated 5 months ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.☆34Oct 11, 2025Updated 4 months ago
- ☆15May 8, 2021Updated 4 years ago
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆21Mar 21, 2022Updated 3 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Jul 17, 2020Updated 5 years ago
- ☆18Oct 31, 2022Updated 3 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- ICASSP 2021 accepted paper☆20May 20, 2021Updated 4 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- General tools for voice analysis.☆25Jul 30, 2025Updated 6 months ago
- ☆23Oct 17, 2024Updated last year
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆59Jan 24, 2024Updated 2 years ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Oct 15, 2024Updated last year
- ☆24Feb 28, 2023Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆146May 10, 2022Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆29Jan 16, 2026Updated 3 weeks ago
- Streaming Vocos☆29Jun 10, 2025Updated 8 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- 语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download☆56Jul 24, 2022Updated 3 years ago
- ☆23Jun 25, 2025Updated 7 months ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆439Aug 12, 2025Updated 6 months ago