biboamy / TVSM-dataset
☆78Updated 3 months ago
Alternatives and similar repositories for TVSM-dataset:
Users that are interested in TVSM-dataset are comparing it to the libraries listed below
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆91Updated 5 months ago
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.☆39Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆143Updated 7 months ago
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆39Updated last year
- Benchmark popular audio i/o packages☆140Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆101Updated 2 months ago
- ☆79Updated 7 months ago
- ☆43Updated 7 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated 2 years ago
- Self-supervised learning for fast pitch estimation☆199Updated last month
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆91Updated this week
- A DDSP-based neural voice synthesiser.☆112Updated 2 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆62Updated last year
- Expressive Anechoic Recordings of Speech (EARS)☆141Updated 6 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆43Updated 5 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆154Updated 2 years ago
- The open source code for SimpleSpeech series☆121Updated 3 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆44Updated 4 months ago
- A sequence-to-sequence voice conversion toolkit.☆92Updated 6 months ago
- ☆36Updated 3 months ago
- Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆79Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆64Updated last month
- ☆57Updated last year
- Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset☆78Updated 11 months ago
- Yin pitch estimator in PyTorch☆115Updated 2 years ago
- An open source platform for browser based speech and audio subjective quality tests.☆33Updated last year
- ☆183Updated 11 months ago
- ☆62Updated 9 months ago
- Clustering-based methods for overlapping diarization☆74Updated last year