biboamy / TVSM-datasetLinks
☆93Updated last year
Alternatives and similar repositories for TVSM-dataset
Users that are interested in TVSM-dataset are comparing it to the libraries listed below
Sorting:
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆100Updated last year
- SelfRemaster: SSL Speech Restoration☆93Updated last year
- A collection of audio signals accompanied by corresponding subjective scores of perceived quality. Everything under permissive licenses.☆45Updated 3 weeks ago
- Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset☆87Updated last year
- Expressive Anechoic Recordings of Speech (EARS)☆202Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆158Updated 3 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated last year
- ☆65Updated 6 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Updated 3 years ago
- [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription☆48Updated last year
- This code is to run the WARP-Q speech quality metric.☆35Updated last year
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.☆47Updated 8 months ago
- A sequence-to-sequence voice conversion toolkit.☆106Updated last year
- ☆45Updated last year
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆104Updated 2 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆151Updated 6 months ago
- Predicts the level of noise and reverberation on your audiofiles☆174Updated 6 months ago
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆79Updated 3 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆157Updated 3 years ago
- DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆88Updated 8 months ago
- A complete training recipe for kaldi-based Automatic Lyrics Transcription.☆31Updated 4 years ago
- ☆65Updated 2 years ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆113Updated 6 months ago
- ☆80Updated 4 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆94Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Updated 2 years ago
- Clustering-based methods for overlapping diarization☆82Updated last year
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆101Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆121Updated 10 months ago