Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
☆47May 7, 2024Updated 2 years ago
Alternatives and similar repositories for SpeakerRecognitionFromScratch
Users that are interested in SpeakerRecognitionFromScratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆108Feb 21, 2023Updated 3 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 4 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15May 8, 2021Updated 5 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆451Aug 12, 2025Updated 10 months ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆213Jul 17, 2020Updated 5 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- some scripts for asvspoof2017☆11Dec 27, 2018Updated 7 years ago
- ☆27Dec 14, 2022Updated 3 years ago
- Curriculum Vitae of Quan Wang☆15Dec 13, 2025Updated 6 months ago
- A simple implementation for improving CosyVoice2 by GRPO method☆38May 5, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Mar 11, 2025Updated last year
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 5 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- ☆11Dec 28, 2023Updated 2 years ago
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆73Dec 23, 2025Updated 6 months ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆61Dec 1, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆22Mar 21, 2022Updated 4 years ago
- Python implementation of sinewave speech, as a command-line tool☆14May 30, 2020Updated 6 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆144May 10, 2022Updated 4 years ago
- 大工计算机系大二学年资料☆10May 15, 2020Updated 6 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆37Aug 30, 2025Updated 10 months ago
- Example of application of genetic algorithm for evolution kart navigation.☆11Nov 21, 2019Updated 6 years ago
- CMake的实例程序☆13Sep 27, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆31Aug 9, 2022Updated 3 years ago
- Prediction of sound event bounding boxes (SEBBs)☆35Aug 2, 2024Updated last year
- ☆15Apr 16, 2026Updated 2 months ago
- ☆16Apr 2, 2025Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- ☆43Feb 8, 2025Updated last year
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 3 years ago