Rumeysakeskin / Speaker-VerificationLinks
Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, SpeakerNet, TitaNet-L).
☆36Updated last year
Alternatives and similar repositories for Speaker-Verification
Users that are interested in Speaker-Verification are comparing it to the libraries listed below
Sorting:
- Download speech datasets (English and non-English) for Automatic Speech Recognition☆15Updated 2 years ago
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan☆57Updated last year
- InceptionV3-Multi-layer GRU based automatic image captioning with Keras and TensorFlow frameworks☆20Updated 2 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆43Updated 3 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆126Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated last month
- ☆58Updated last year
- In this repository, we provide a neural model based on BERT and BiLSTM neural networks, which can recognize the Kasreh Ezafeh (Genitive C…☆7Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆40Updated 2 years ago
- Automatic image captioning on Android-based mobile application with CNN and multi-layer GRU encoder-decoder model☆14Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆80Updated 11 months ago
- finetune llm part for spark-tts model☆99Updated 3 months ago
- Online streaming speaker change detection model in Pytorch☆41Updated 2 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 4 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆54Updated 5 months ago
- ☆49Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆54Updated 2 years ago
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆61Updated 2 years ago
- ☆26Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 2 years ago
- Diarization Metric in One: current support DER, JER, CDER, SER, and BER☆9Updated 2 years ago
- Onnx compatible styletts2 code☆12Updated last month
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆64Updated 2 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 6 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆41Updated 11 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆88Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago