biboamy / TVSM-datasetLinks
☆86Updated 8 months ago
Alternatives and similar repositories for TVSM-dataset
Users that are interested in TVSM-dataset are comparing it to the libraries listed below
Sorting:
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆97Updated 11 months ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 8 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆157Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆149Updated 3 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated last year
- ☆44Updated last year
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆79Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆87Updated 9 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 10 months ago
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.☆45Updated 2 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆47Updated 3 weeks ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆54Updated last year
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆100Updated 3 years ago
- SelfRemaster: SSL Speech Restoration☆89Updated last year
- Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset☆82Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆112Updated 4 months ago
- ☆67Updated last year
- PAM is a no-reference audio quality metric for audio generation tasks☆64Updated 11 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆76Updated 6 months ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆101Updated 5 months ago
- Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features☆85Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆152Updated last week
- ☆57Updated 2 years ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆73Updated last week
- ☆139Updated 2 months ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆80Updated 2 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆188Updated 2 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆166Updated last year