Unofficial implementation of ECAPA-TDNN
☆30Feb 28, 2021Updated 5 years ago
Alternatives and similar repositories for ECAPA-TDNN
Users that are interested in ECAPA-TDNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆97Sep 15, 2021Updated 4 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated 2 years ago
- ☆26Nov 2, 2022Updated 3 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆105Sep 2, 2021Updated 4 years ago
- Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.☆13Mar 8, 2016Updated 10 years ago
- In defence of metric learning for speaker recognition☆1,164Mar 26, 2024Updated 2 years ago
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 6 months ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆795Apr 11, 2024Updated last year
- Generalized Sentiment Classifier finetuned by KoELECTRA☆11Nov 28, 2024Updated last year
- Home power monitor using Spark Core☆11Oct 1, 2015Updated 10 years ago
- ☆11May 4, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆159Jan 9, 2023Updated 3 years ago
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- ☆94Apr 24, 2025Updated 11 months ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- Pre-trained grapheme-to-phoneme (G2P) models☆26Jul 27, 2021Updated 4 years ago
- UT-Sarulab MOS prediction system using SSL models☆297Apr 11, 2024Updated last year
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated 2 years ago
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆14Aug 19, 2024Updated last year
- Training code for MaskGCT-T2S model.☆25Dec 14, 2024Updated last year
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- ☆59Mar 28, 2025Updated last year
- experiments about AudioSet☆43Jul 22, 2023Updated 2 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- [INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …☆172May 20, 2025Updated 10 months ago
- A Demo of Mandarin/Chinese TTS frontend☆285Apr 18, 2022Updated 3 years ago
- Share some recent speaker recognition papers and their implementations.☆90Sep 26, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…☆11Sep 27, 2022Updated 3 years ago
- Experiments with Kidde / Wink 433MHz wireless protocol☆15Jul 25, 2020Updated 5 years ago
- ☆50Aug 16, 2023Updated 2 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 3 years ago
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- PyTorch implementation of simplified neural source filter model (s-nsf)☆14Aug 4, 2021Updated 4 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆88Apr 23, 2019Updated 6 years ago