iiscleap / NISP-DatasetLinks
☆30Updated 3 years ago
Alternatives and similar repositories for NISP-Dataset
Users that are interested in NISP-Dataset are comparing it to the libraries listed below
Sorting:
- Clustering-based methods for overlapping diarization☆82Updated last year
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- A unified dataset of multilingual emotional human utterances☆29Updated 3 years ago
- PyTorch implementation of RPNSD☆60Updated last year
- ☆54Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 3 years ago
- ☆19Updated 3 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Updated 4 years ago
- Balanced Error Rate for Speaker Diarization☆33Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆61Updated 5 years ago
- Implementation of audio degradation processes☆105Updated 10 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆92Updated last year
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆77Updated 3 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆148Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 3 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 6 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆63Updated 4 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 8 months ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆156Updated 3 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Updated 3 years ago
- fast SpecAugmentation code with numpy and scipy☆31Updated 6 years ago
- Discriminative Training of VBx Diarization☆26Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- A list of papers for child ASR☆50Updated last year
- ☆61Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 3 years ago