Fraunhofer-IIS / ODAQ
☆40Updated 6 months ago
Alternatives and similar repositories for ODAQ:
Users that are interested in ODAQ are comparing it to the libraries listed below
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆30Updated 4 months ago
- Landing Page for All Things Source Separation☆27Updated 5 months ago
- ☆43Updated 10 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆70Updated 3 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆40Updated 5 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated last month
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆46Updated 6 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆43Updated last month
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆95Updated 9 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆66Updated last month
- PAM is a no-reference audio quality metric for audio generation tasks☆58Updated 9 months ago
- Prediction of sound event bounding boxes (SEBBs)☆26Updated 8 months ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆38Updated 8 months ago
- ☆38Updated 3 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆47Updated last month
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆28Updated 2 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆60Updated 2 years ago
- Open source code for the paper 'Music Source Separation with Generative Flow'☆23Updated 2 years ago
- Landing Page for Divide and Remaster v3☆17Updated 9 months ago
- ☆20Updated 3 weeks ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆34Updated 4 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆86Updated 4 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆25Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆36Updated 5 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆93Updated 7 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 8 months ago
- This is the official implementation of reverberant speech to room impulse response estimator☆30Updated 8 months ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆34Updated 7 months ago
- Official implementation for FlowSep☆42Updated 3 months ago