Deep Speaker: an End-to-End Neural Speaker Embedding System.
☆941Apr 13, 2024Updated 2 years ago
Alternatives and similar repositories for deep-speaker
Users that are interested in deep-speaker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speaker embedding(verification and recognition) using Pytorch☆369Jul 24, 2020Updated 5 years ago
- Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)☆253Apr 27, 2020Updated 6 years ago
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆790Mar 3, 2020Updated 6 years ago
- Utterance-level Aggregation For Speaker Recognition In The Wild☆372Mar 24, 2023Updated 3 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆598Jan 20, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of state of the art d-vector approach for speaker verification☆127Oct 1, 2017Updated 8 years ago
- Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"☆367Oct 9, 2021Updated 4 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆310Nov 19, 2018Updated 7 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- Official repository for RawNet, RawNet2, and RawNet3☆406Mar 21, 2024Updated 2 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Mar 18, 2019Updated 7 years ago
- 基于dVector的说话人识别keras☆89Nov 30, 2020Updated 5 years ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,240Apr 28, 2021Updated 5 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,872Jun 1, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Speaker embedding (d-vector) trained with GE2E loss☆289Jan 8, 2024Updated 2 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆500Jul 1, 2021Updated 4 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆213Jul 17, 2020Updated 5 years ago
- An Open Source Tools for Speaker Recognition☆637Aug 5, 2024Updated last year
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,588Sep 25, 2024Updated last year
- TristouNet: Triplet Loss for Speaker Turn Embedding☆122Jul 6, 2017Updated 8 years ago
- Share some recent speaker recognition papers and their implementations.☆89Sep 26, 2019Updated 6 years ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆402Feb 4, 2019Updated 7 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,051Jul 5, 2023Updated 2 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆207Dec 8, 2022Updated 3 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,643Apr 22, 2024Updated 2 years ago
- The Implementation of FastSpeech based on pytorch.☆883Jul 6, 2023Updated 2 years ago
- A library for speech data augmentation in time-domain☆689Aug 30, 2021Updated 4 years ago
- In defence of metric learning for speaker recognition☆1,169Apr 22, 2026Updated last month
- Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.☆552Sep 25, 2024Updated last year
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆118Nov 5, 2019Updated 6 years ago
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆31Jun 30, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…☆3,128Oct 19, 2023Updated 2 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆102Apr 15, 2017Updated 9 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 4 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,398Mar 14, 2022Updated 4 years ago
- ☆484Oct 29, 2020Updated 5 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆339Jul 6, 2023Updated 2 years ago
- List of speech synthesis papers.☆1,072Jul 24, 2023Updated 2 years ago