Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris Smaragdis and Jonathan Le Roux
☆14Feb 15, 2023Updated 3 years ago
Alternatives and similar repositories for optimal_condition_training
Users that are interested in optimal_condition_training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jan 17, 2023Updated 3 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆43Dec 6, 2022Updated 3 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Interspeech Tutorial - Resource Efficient and Cross-Modal Learning Toward Foundation Modeling☆15Oct 9, 2023Updated 2 years ago
- Code for the EMNLP 2022 Findings short paper "SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Train…☆13Feb 25, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆13Jul 3, 2024Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆17May 19, 2023Updated 2 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆58Apr 25, 2023Updated 2 years ago
- ICASSP 2024 paper - A Fully Differentiable Model for Unsupervised Singing Voice Separation☆14Mar 7, 2025Updated last year
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- Official implementation for MGN☆20Dec 22, 2022Updated 3 years ago
- Code for the paper: Separate but togerher: Unsupervised Federated Learning for Speech Enhancement from non-iid data☆41Nov 1, 2021Updated 4 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 3 years ago
- First steps in Machine Learning☆12Mar 18, 2015Updated 11 years ago
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation☆39Nov 20, 2024Updated last year
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 3 years ago
- ☆30Jun 12, 2025Updated 10 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- ☆51May 16, 2021Updated 4 years ago
- Official PyTorch implementation of "MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks"☆12Dec 4, 2025Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆18Aug 16, 2025Updated 7 months ago
- text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)☆12Oct 15, 2024Updated last year
- Weird autoencoder experiments☆24Jan 26, 2026Updated 2 months ago
- The open source code for LLM-Codec☆146Aug 18, 2024Updated last year
- A deep learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input…☆49May 21, 2025Updated 10 months ago
- ☆30Apr 22, 2024Updated last year
- Tools to run experiments around large scale cover detection.☆28Sep 30, 2022Updated 3 years ago
- A Dataset for Cover Song Identification and Understanding☆65Feb 23, 2023Updated 3 years ago
- ☆11Feb 14, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022☆145Oct 11, 2023Updated 2 years ago
- Arduino library for the Maxim DS1337 I2C RTC.☆11Aug 20, 2014Updated 11 years ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- ☆30Nov 5, 2023Updated 2 years ago
- The source code of Tim-TSENet☆15Apr 22, 2022Updated 3 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆100Jul 24, 2024Updated last year
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 4 years ago