Speech-Interaction-Technology-Aalto-U / itsp
Introduction to Speech Processing
☆79Updated 3 months ago
Alternatives and similar repositories for itsp:
Users that are interested in itsp are comparing it to the libraries listed below
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆129Updated last month
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆127Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆143Updated 7 months ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆70Updated 9 months ago
- Expressive Anechoic Recordings of Speech (EARS)☆141Updated 6 months ago
- PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio☆162Updated last year
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆130Updated 11 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆60Updated 2 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆88Updated last year
- Reference-aware automatic speech evaluation toolkit☆139Updated last month
- Libri-CSS: dataset and evaluation pipeline☆141Updated 2 years ago
- UT-Sarulab MOS prediction system using SSL models☆199Updated 9 months ago
- Official repository of NeXt-TDNN for speaker verification☆65Updated 3 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆144Updated 2 years ago
- The VoxTube dataset official repository☆64Updated 11 months ago
- Clustering-based methods for overlapping diarization☆74Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆100Updated 2 months ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆95Updated last year
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆249Updated 8 months ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆115Updated 2 years ago
- An implementation of audio source separation tools.☆78Updated last year
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆159Updated last month
- Spot the conversation: speaker diarisation in the wild☆132Updated 2 years ago
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆202Updated 4 months ago
- ☆109Updated 3 years ago
- This package aims at simplifying the download of the AudioSet dataset.☆45Updated last year
- ☆183Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆47Updated 2 months ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆89Updated last year
- Big Impulse Response Dataset☆141Updated 2 years ago