Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris Smaragdis and Jonathan Le Roux
☆14Feb 15, 2023Updated 3 years ago
Alternatives and similar repositories for optimal_condition_training
Users that are interested in optimal_condition_training are comparing it to the libraries listed below
Sorting:
- ☆14Jan 17, 2023Updated 3 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆43Dec 6, 2022Updated 3 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Interspeech Tutorial - Resource Efficient and Cross-Modal Learning Toward Foundation Modeling☆15Oct 9, 2023Updated 2 years ago
- Code for the EMNLP 2022 Findings short paper "SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Train…☆13Feb 25, 2023Updated 3 years ago
- ☆13Jul 3, 2024Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16May 19, 2023Updated 2 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆57Apr 25, 2023Updated 2 years ago
- ICASSP 2024 paper - A Fully Differentiable Model for Unsupervised Singing Voice Separation☆14Mar 7, 2025Updated last year
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- Official implementation for MGN☆20Dec 22, 2022Updated 3 years ago
- Code for the paper: Separate but togerher: Unsupervised Federated Learning for Speech Enhancement from non-iid data☆42Nov 1, 2021Updated 4 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 3 years ago
- First steps in Machine Learning☆12Mar 18, 2015Updated 11 years ago
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation☆39Nov 20, 2024Updated last year
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16May 14, 2022Updated 3 years ago
- ☆30Jun 12, 2025Updated 9 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- Official PyTorch implementation of "MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks"☆12Dec 4, 2025Updated 3 months ago
- ☆51May 16, 2021Updated 4 years ago
- ☆18Aug 16, 2025Updated 7 months ago
- Weird autoencoder experiments☆24Jan 26, 2026Updated last month
- text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)☆12Oct 15, 2024Updated last year
- The open source code for LLM-Codec☆145Aug 18, 2024Updated last year
- A deep learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input…☆47May 21, 2025Updated 10 months ago
- Tools to run experiments around large scale cover detection.☆28Sep 30, 2022Updated 3 years ago
- ☆31Apr 22, 2024Updated last year
- A Dataset for Cover Song Identification and Understanding☆64Feb 23, 2023Updated 3 years ago
- ☆11Feb 14, 2025Updated last year
- Arduino library for the Maxim DS1337 I2C RTC.☆11Aug 20, 2014Updated 11 years ago
- This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022☆145Oct 11, 2023Updated 2 years ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- ☆30Nov 5, 2023Updated 2 years ago
- The source code of Tim-TSENet☆15Apr 22, 2022Updated 3 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago