Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
Alternatives and similar repositories for dcase2024_task9_baseline
Users that are interested in dcase2024_task9_baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift☆18Mar 16, 2024Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Oct 13, 2023Updated 2 years ago
- Official implementation for FlowSep☆74Jan 2, 2025Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆83May 21, 2025Updated 10 months ago
- ☆12Nov 7, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆115Jan 28, 2026Updated 2 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Sep 27, 2024Updated last year
- Discogs-VI dataset and code☆20Dec 13, 2024Updated last year
- Single channel speech source separation by diffusion process (ICASSP 2023)☆126Mar 15, 2024Updated 2 years ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆18Nov 19, 2024Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆54Apr 28, 2022Updated 3 years ago
- ☆30Apr 22, 2024Updated last year
- ☆23Mar 19, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12Mar 11, 2025Updated last year
- ☆212Dec 5, 2024Updated last year
- ☆26Mar 20, 2024Updated 2 years ago
- ☆68Aug 16, 2023Updated 2 years ago
- Official Repository for paper "Ambisonizer: Neural Upmixing as Spherical Harmonics Generation"☆16May 27, 2024Updated last year
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆200Dec 13, 2024Updated last year
- ☆28Mar 28, 2024Updated 2 years ago
- The source code of Tim-TSENet☆15Apr 22, 2022Updated 3 years ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆214Sep 19, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Boosting Self-Supervised Embeddings for Speech Enhancement