iiscleap / NISP-DatasetLinks
☆30Updated 3 years ago
Alternatives and similar repositories for NISP-Dataset
Users that are interested in NISP-Dataset are comparing it to the libraries listed below
Sorting:
- Clustering-based methods for overlapping diarization☆82Updated 2 years ago
- A unified dataset of multilingual emotional human utterances☆29Updated 3 weeks ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆95Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- Implementation of audio degradation processes☆105Updated 10 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 3 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Updated 4 years ago
- ☆53Updated 2 years ago
- PyTorch implementation of RPNSD☆60Updated last year
- ☆62Updated last year
- A list of papers for child ASR☆51Updated last year
- ☆19Updated 3 years ago
- ☆37Updated 4 years ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆106Updated 2 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆63Updated 6 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 6 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆61Updated 5 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆78Updated 3 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- Machine learning speaker characteristics☆43Updated this week
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Updated 3 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 5 years ago
- Discriminative Training of VBx Diarization☆27Updated last year
- ☆66Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆109Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆33Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 3 years ago
- Python toolkit for speech processing☆72Updated 3 weeks ago