Joovvhan/ECAPA-TDNN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Joovvhan/ECAPA-TDNN)

Joovvhan / ECAPA-TDNN

Unofficial implementation of ECAPA-TDNN

☆30

Alternatives and similar repositories for ECAPA-TDNN

Users that are interested in ECAPA-TDNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ranchlai / speaker-verification
View on GitHub
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
☆97Sep 15, 2021Updated 4 years ago
ml-for-speech / speechtoolkit
View on GitHub
[Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…
☆21Jan 10, 2025Updated last year
ex3ndr / supervoice-gpt
View on GitHub
GPT-style network for phonemization with durations of text
☆68Mar 21, 2024Updated 2 years ago
yangdongchao / ALMTokenizer2
View on GitHub
The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…
☆45Sep 5, 2025Updated 10 months ago
msh9184 / ska-tdnn
View on GitHub
☆26Nov 2, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
CSLT-THU / IS2019-VAE
View on GitHub
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
lawlict / ECAPA-TDNN
View on GitHub
☆106Sep 2, 2021Updated 4 years ago
danpovey / openfst
View on GitHub
Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.
☆13Mar 8, 2016Updated 10 years ago
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 3 months ago
manojpamk / pytorch_xvectors
View on GitHub
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆321Nov 11, 2020Updated 5 years ago
TaoRuijie / ECAPA-TDNN
View on GitHub
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
☆823Apr 11, 2024Updated 2 years ago
jaehyeongAN / KoELECTRA-finetuned-sentiment-analysis
View on GitHub
Generalized Sentiment Classifier finetuned by KoELECTRA
☆11Nov 28, 2024Updated last year
zyzisyz / mfa_conformer
View on GitHub
☆160Jan 9, 2023Updated 3 years ago
tstafylakis / Speaker-Embeddings-Correlation-Pooling
View on GitHub
Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"
☆11Sep 20, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
BUTSpeechFIT / EEND
View on GitHub
☆95Apr 24, 2025Updated last year
mycrazycracy / speaker-embedding-with-phonetic-information
View on GitHub
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆45Jul 10, 2019Updated 7 years ago
petronny / g2p
View on GitHub
Pre-trained grapheme-to-phoneme (G2P) models
☆26Jul 27, 2021Updated 5 years ago
feerci / feerci
View on GitHub
FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates
☆12Mar 13, 2024Updated 2 years ago
Livefull / SphereDiar
View on GitHub
☆11May 4, 2020Updated 6 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
sarulab-speech / UTMOS22
View on GitHub
UT-Sarulab MOS prediction system using SSL models
☆309Apr 11, 2024Updated 2 years ago
nene1212 / MaskGCT-Training
View on GitHub
Training code for MaskGCT-T2S model.
☆25Dec 14, 2024Updated last year
ga642381 / AudioCodec-Hub
View on GitHub
AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models
☆25Sep 26, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yzyouzhang / SASV_PR
View on GitHub
Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"
☆18Jun 24, 2022Updated 4 years ago
BUTSpeechFIT / EEND_dataprep
View on GitHub
☆59Mar 28, 2025Updated last year
kyegomez / SoundStream
View on GitHub
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"
☆13Jan 27, 2025Updated last year
GT-KIM / specmix
View on GitHub
This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…
☆11Sep 27, 2022Updated 3 years ago
lijuncheng16 / AudioTaggingDoneRight
View on GitHub
experiments about AudioSet
☆43Jul 22, 2023Updated 3 years ago
bjfu-ai-institute / speaker-recognition-papers
View on GitHub
Share some recent speaker recognition papers and their implementations.
☆89Sep 26, 2019Updated 6 years ago
Choddeok / EmoSphere-TTS
View on GitHub
[INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …
☆182Jul 16, 2026Updated last week
makerjackie / MTTS
View on GitHub
A Demo of Mandarin/Chinese TTS frontend
☆284Apr 18, 2022Updated 4 years ago
IanHarvey / spark-monitor
View on GitHub
Home power monitor using Spark Core
☆11Oct 1, 2015Updated 10 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
PhonemeHallucinator / Phoneme_Hallucinator
View on GitHub
☆48Aug 16, 2023Updated 2 years ago
Akella17 / speaker-embedding
View on GitHub
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
☆10Feb 19, 2018Updated 8 years ago
choiHkk / nix-tts
View on GitHub
End-To-End SpeechSynthesis system with knowledge distillation
☆18Jul 16, 2022Updated 4 years ago
wentaozhu / speechnas
View on GitHub
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
☆30Mar 24, 2023Updated 3 years ago
immopoly / android
View on GitHub
The Android client app
☆15Mar 6, 2013Updated 13 years ago
Takaaki-Saeki / simplified_neural_source_filter
View on GitHub
PyTorch implementation of simplified neural source filter model (s-nsf)
☆14Aug 4, 2021Updated 4 years ago
cvqluu / GE2E-Loss
View on GitHub
Pytorch implementation of Generalized End-to-End Loss for speaker verification
☆88Apr 23, 2019Updated 7 years ago