Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
☆47May 7, 2024Updated last year
Alternatives and similar repositories for SpeakerRecognitionFromScratch
Users that are interested in SpeakerRecognitionFromScratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆107Feb 21, 2023Updated 3 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15May 8, 2021Updated 4 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆447Aug 12, 2025Updated 8 months ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Jul 17, 2020Updated 5 years ago
- ☆18Oct 31, 2022Updated 3 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- GeoKrige is a Python package designed for spatial interpolation using Kriging Methods. While primarily tailored for geospatial analysis,…☆12Mar 26, 2024Updated 2 years ago
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- some scripts for asvspoof2017☆11Dec 27, 2018Updated 7 years ago
- ☆28Dec 14, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Curriculum Vitae of Quan Wang☆15Dec 13, 2025Updated 4 months ago
- A simple implementation for improving CosyVoice2 by GRPO method☆37Oct 17, 2025Updated 6 months ago
- ☆15Jul 15, 2019Updated 6 years ago
- ☆12Mar 11, 2025Updated last year
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 4 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- A git repo showcasing RAG Techniques for building Naive to Advance RAG solutions☆13Feb 16, 2025Updated last year
- ☆11Dec 28, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…☆12Apr 28, 2025Updated 11 months ago
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆71Dec 23, 2025Updated 3 months ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆60Dec 1, 2024Updated last year
- 本仓库致力于收集和分享与**EEG(脑电图)**相关的研究论文。目标是创建一个协作学习的平台,让所有对EEG技术、应用和创新感兴趣的人都能在这里学习、分享和共同进步。无论你是学生 、研究人员还是EEG爱好者,都可以在这里找到最新的论文、分享自己的见解,并贡献自己的知识。☆20Mar 7, 2025Updated last year
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆22Mar 21, 2022Updated 4 years ago
- Boiling Insights - From raw S3 data to charts in seconds☆23Dec 12, 2024Updated last year
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆145May 10, 2022Updated 3 years ago
- Agentic Github Issues Retrieval on Kubernetes☆27Aug 5, 2025Updated 8 months ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 7 months ago
- Example of application of genetic algorithm for evolution kart navigation.☆11Nov 21, 2019Updated 6 years ago
- CMake的实例程序☆12Sep 27, 2021Updated 4 years ago
- ☆31Aug 9, 2022Updated 3 years ago