iiscleap / NISP-DatasetLinks
☆30Updated 3 years ago
Alternatives and similar repositories for NISP-Dataset
Users that are interested in NISP-Dataset are comparing it to the libraries listed below
Sorting:
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 3 years ago
- Clustering-based methods for overlapping diarization☆82Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Updated 4 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆93Updated last year
- A unified dataset of multilingual emotional human utterances☆29Updated 4 years ago
- Implementation of audio degradation processes☆105Updated 10 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆157Updated 3 years ago
- ☆54Updated 2 years ago
- Script to perform statistical significance test between ASR hypotheses.☆22Updated 8 years ago
- Python toolkit for speech processing☆72Updated 3 weeks ago
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆45Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Updated last year
- Discriminative Condition-Aware PLDA☆44Updated last year
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆111Updated 2 years ago
- ☆19Updated 3 years ago
- ☆62Updated last year
- PyTorch implementation of RPNSD☆60Updated last year
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 3 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- A list of papers for child ASR☆50Updated last year
- MultiSV: scripts for data preparation☆28Updated 11 months ago
- The VoxTube dataset official repository☆71Updated last year
- A simple package for Guided source separation (GSS)☆132Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆61Updated 11 months ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Updated 2 years ago
- Keyword spotting and forced alignment in any language☆82Updated 4 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Updated 3 years ago