aldragan0 / voice-recognitionLinks
Voice-based gender, age and language recognition.
☆43Updated 6 years ago
Alternatives and similar repositories for voice-recognition
Users that are interested in voice-recognition are comparing it to the libraries listed below
Sorting:
- Goodness of Pronunciation (GOP) for oral reading assessment.☆52Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆56Updated 4 years ago
- ☆92Updated last year
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆108Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆61Updated 4 years ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆127Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆48Updated 3 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆127Updated 3 years ago
- ☆30Updated 3 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆69Updated 3 years ago
- Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.☆29Updated 5 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆42Updated 2 years ago
- py-webrtcvad wrapper for trimming speech clips☆48Updated 3 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆217Updated 2 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆145Updated 3 years ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.☆50Updated 4 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆31Updated last year
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆48Updated 8 months ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆50Updated 6 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆94Updated 2 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆108Updated 3 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆22Updated last year
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Updated 6 years ago
- ☆121Updated 2 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- SEAME corpus two develop set☆41Updated 5 years ago