biboamy / TVSM-dataset
☆78Updated last month
Related projects ⓘ
Alternatives and complementary repositories for TVSM-dataset
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆88Updated 3 months ago
- Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset☆77Updated 9 months ago
- A sequence-to-sequence voice conversion toolkit.☆86Updated 4 months ago
- This code is to run the WARP-Q speech quality metric.☆34Updated last month
- Clustering-based methods for overlapping diarization☆70Updated 10 months ago
- Predicts the level of noise and reverberation on your audiofiles☆138Updated 6 months ago
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆38Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆44Updated 4 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 3 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆84Updated last month
- This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purp…☆62Updated 4 years ago
- Various speech datasets made available to the public☆99Updated last month
- ☆40Updated 5 months ago
- ☆77Updated 6 months ago
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.☆36Updated last year
- AudioBench: A Universal Benchmark for Audio Large Language Models☆93Updated last week
- ☆27Updated 7 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆37Updated last month
- Inference code for PaSST, using the HEAR API.☆29Updated 10 months ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆28Updated 3 months ago
- ☆49Updated 9 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆81Updated 8 months ago
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…☆85Updated 2 years ago
- A simple package for Guided source separation (GSS)☆107Updated 6 months ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆83Updated 2 years ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆42Updated 3 months ago
- Reference-aware automatic speech evaluation toolkit☆109Updated 9 months ago