Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
☆47May 7, 2024Updated 2 years ago
Alternatives and similar repositories for SpeakerRecognitionFromScratch
Users that are interested in SpeakerRecognitionFromScratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆108Feb 21, 2023Updated 3 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15May 8, 2021Updated 5 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆451Aug 12, 2025Updated 9 months ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆213Jul 17, 2020Updated 5 years ago
- ☆18Oct 31, 2022Updated 3 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- some scripts for asvspoof2017☆11Dec 27, 2018Updated 7 years ago
- ☆28Dec 14, 2022Updated 3 years ago
- Curriculum Vitae of Quan Wang☆15Dec 13, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple implementation for improving CosyVoice2 by GRPO method☆38May 5, 2026Updated last month
- ☆15Jul 15, 2019Updated 6 years ago
- ☆12Mar 11, 2025Updated last year
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 5 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…☆12Apr 28, 2025Updated last year
- ☆11Dec 28, 2023Updated 2 years ago
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆45Oct 24, 2020Updated 5 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆73Dec 23, 2025Updated 5 months ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆61Dec 1, 2024Updated last year
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆22Mar 21, 2022Updated 4 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆144May 10, 2022Updated 4 years ago
- 大工计算机系大二学年资料☆11May 15, 2020Updated 6 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Example of application of genetic algorithm for evolution kart navigation.☆11Nov 21, 2019Updated 6 years ago
- CMake的实例程序☆13Sep 27, 2021Updated 4 years ago
- ☆31Aug 9, 2022Updated 3 years ago
- Prediction of sound event bounding boxes (SEBBs)☆35Aug 2, 2024Updated last year
- ☆15Apr 16, 2026Updated last month
- ☆15Apr 2, 2025Updated last year
- ☆43Feb 8, 2025Updated last year