iiscleap / NISP-Dataset
☆27Updated 2 years ago
Alternatives and similar repositories for NISP-Dataset:
Users that are interested in NISP-Dataset are comparing it to the libraries listed below
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆13Updated 2 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆75Updated 2 years ago
- ☆52Updated last year
- MultiSV: scripts for data preparation☆27Updated last week
- Discriminative Training of VBx Diarization☆22Updated 4 months ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆46Updated last month
- ☆16Updated 2 years ago
- Baseline kaldi script for UA-SPEECH corpus☆29Updated 3 months ago
- Clustering-based methods for overlapping diarization☆74Updated last year
- ☆32Updated 3 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆28Updated last year
- A list of papers for child ASR☆35Updated 3 months ago
- Script to perform statistical significance test between ASR hypotheses.☆21Updated 7 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- Discriminative Condition-Aware PLDA☆43Updated 6 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated last year
- ☆51Updated 8 months ago
- A unified dataset of multilingual emotional human utterances☆24Updated 3 years ago
- PyTorch implementation of RPNSD☆60Updated 7 months ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆25Updated 6 months ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Implementation of audio degradation processes☆101Updated 9 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assess…☆50Updated last month
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆97Updated 4 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago