Rumeysakeskin / Speaker-VerificationLinks
Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, SpeakerNet, TitaNet-L).
☆40Updated 2 years ago
Alternatives and similar repositories for Speaker-Verification
Users that are interested in Speaker-Verification are comparing it to the libraries listed below
Sorting:
- InceptionV3-Multi-layer GRU based automatic image captioning with Keras and TensorFlow frameworks☆20Updated 3 years ago
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan☆62Updated 2 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- Download speech datasets (English and non-English) for Automatic Speech Recognition☆15Updated 2 years ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆127Updated 2 years ago
- VoxLingua107 recipe for SpeechBrain☆13Updated 4 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆47Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆67Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 6 months ago
- Automatic image captioning on Android-based mobile application with CNN and multi-layer GRU encoder-decoder model☆14Updated 3 years ago
- ☆22Updated 4 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Online streaming speaker change detection model in Pytorch☆43Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆105Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆91Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆42Updated 3 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆67Updated 3 years ago
- ☆13Updated 4 years ago
- Clustering-based methods for overlapping diarization☆82Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆58Updated 4 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆37Updated last year
- ☆35Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆58Updated 10 months ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Updated 5 years ago
- ☆25Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Tunable pipelines☆40Updated 3 months ago
- ☆93Updated last month
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year