Rumeysakeskin / Speaker-VerificationLinks
Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, SpeakerNet, TitaNet-L).
☆37Updated 2 years ago
Alternatives and similar repositories for Speaker-Verification
Users that are interested in Speaker-Verification are comparing it to the libraries listed below
Sorting:
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan☆60Updated last year
- Download speech datasets (English and non-English) for Automatic Speech Recognition☆15Updated 2 years ago
- InceptionV3-Multi-layer GRU based automatic image captioning with Keras and TensorFlow frameworks☆20Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆46Updated 4 years ago
- ☆62Updated 4 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
- Automatic image captioning on Android-based mobile application with CNN and multi-layer GRU encoder-decoder model☆14Updated 2 years ago
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆99Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆127Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆89Updated 2 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆89Updated last year
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆98Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 4 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆65Updated 2 years ago
- Wav2vec 2.0 Self-Supervised Pretraining☆52Updated 8 months ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆68Updated 3 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- ☆200Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- Online streaming speaker change detection model in Pytorch☆42Updated 2 years ago
- finetune llm part for spark-tts model☆111Updated 6 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 4 months ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆261Updated 9 months ago
- ☆67Updated 4 months ago
- ☆21Updated 4 years ago