Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris Smaragdis and Jonathan Le Roux
☆14Feb 15, 2023Updated 3 years ago
Alternatives and similar repositories for optimal_condition_training
Users that are interested in optimal_condition_training are comparing it to the libraries listed below
Sorting:
- ☆14Jan 17, 2023Updated 3 years ago
- Interspeech Tutorial - Resource Efficient and Cross-Modal Learning Toward Foundation Modeling☆15Oct 9, 2023Updated 2 years ago
- ICASSP 2024 paper - A Fully Differentiable Model for Unsupervised Singing Voice Separation☆14Mar 7, 2025Updated 11 months ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16May 19, 2023Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆43Dec 6, 2022Updated 3 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16May 14, 2022Updated 3 years ago
- Official implementation for MGN☆20Dec 22, 2022Updated 3 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- ☆32Apr 22, 2024Updated last year
- ☆24Feb 28, 2023Updated 3 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆56Apr 25, 2023Updated 2 years ago
- Official Repository for "Music Source Restoration"☆32Jun 1, 2025Updated 9 months ago
- A Dataset for Cover Song Identification and Understanding☆64Feb 23, 2023Updated 3 years ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- Tools to run experiments around large scale cover detection.☆28Sep 30, 2022Updated 3 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 7 months ago
- ☆30Nov 5, 2023Updated 2 years ago
- Code for the paper: Separate but togerher: Unsupervised Federated Learning for Speech Enhancement from non-iid data☆42Nov 1, 2021Updated 4 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation☆39Nov 20, 2024Updated last year
- [ismir2019] Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice☆28Dec 8, 2022Updated 3 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 2 years ago
- A deep learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input…☆46May 21, 2025Updated 9 months ago
- ☆30Jun 12, 2025Updated 8 months ago
- [NeurIPS 2025] Separate Anything in Audio with Zero Training☆56Nov 3, 2025Updated 3 months ago
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 9 months ago
- This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022☆145Oct 11, 2023Updated 2 years ago
- The open source code for LLM-Codec☆145Aug 18, 2024Updated last year
- misc programming languages☆11Jan 10, 2023Updated 3 years ago
- [AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny 300M model!☆87Jan 29, 2026Updated last month
- Chorale Music Separation Dataset and Model Framework☆40Dec 5, 2022Updated 3 years ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆89Jun 12, 2023Updated 2 years ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆114Jan 28, 2026Updated last month
- The Ecoacoustic Dataset from Arctic North Slope Alaska☆11May 29, 2025Updated 9 months ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆22Feb 13, 2026Updated 2 weeks ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- (Interspeech 2025, official code) Speech enhancement based on cascaded two flows☆16Sep 1, 2025Updated 6 months ago