biboamy / TVSM-dataset
☆78Updated 4 months ago
Alternatives and similar repositories for TVSM-dataset:
Users that are interested in TVSM-dataset are comparing it to the libraries listed below
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆93Updated 6 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 6 months ago
- Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset☆79Updated last year
- Clustering-based methods for overlapping diarization☆75Updated last year
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆131Updated 2 months ago
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆45Updated last year
- Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆80Updated last year
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.☆43Updated last year
- easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox☆48Updated 5 years ago
- ☆36Updated 4 months ago
- A simple package for Guided source separation (GSS)☆114Updated 9 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago
- Manage audio and video datasets☆27Updated last week
- ☆47Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆43Updated 6 months ago
- SDX23 startkit for the Demucs baselines.☆26Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆34Updated 5 months ago
- Expressive Anechoic Recordings of Speech (EARS)☆148Updated 7 months ago
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 8 months ago
- Benchmark popular audio i/o packages☆139Updated last year
- This code is to run the WARP-Q speech quality metric.☆34Updated 4 months ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆85Updated 2 years ago
- ☆80Updated 8 months ago
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆80Updated 2 years ago
- ☆80Updated last year
- ☆43Updated 8 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆84Updated 11 months ago
- Paderbox: A collection of utilities for audio / speech processing☆38Updated 8 months ago