YuanGongND / vocalsound
Dataset and baseline code for the VocalSound dataset (ICASSP2022).
☆127Updated 2 years ago
Alternatives and similar repositories for vocalsound:
Users that are interested in vocalsound are comparing it to the libraries listed below
- A library built for easier audio self-supervised training, downstream tasks evaluation☆110Updated 4 months ago
- Audio Captioning datasets for PyTorch.☆111Updated 2 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆144Updated 2 years ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆107Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆21Updated 6 months ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆66Updated last year
- Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.☆107Updated last year
- ☆81Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆115Updated 2 years ago
- Pytorch implementation of subband decomposition☆91Updated 2 years ago
- Libri-CSS: dataset and evaluation pipeline☆141Updated 2 years ago
- A simple package for Guided source separation (GSS)☆112Updated 7 months ago
- Clustering-based methods for overlapping diarization☆74Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆60Updated 2 years ago
- ☆44Updated last year
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆101Updated last year
- ☆30Updated last year
- ☆62Updated 4 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆81Updated 5 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆100Updated 2 months ago
- Repo associated to the DESED dataset, download and creation of data☆131Updated 6 months ago
- Domestic environment sound event detection task☆136Updated 7 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆66Updated 3 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆129Updated last month
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- Official repository of our paper: https://arxiv.org/abs/2010.15366☆61Updated 3 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆89Updated last year
- ConMamba for Automatic Speech Recognition☆53Updated 5 months ago