Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
Alternatives and similar repositories for dcase2024_task9_baseline
Users that are interested in dcase2024_task9_baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift☆18Mar 16, 2024Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆43Oct 13, 2023Updated 2 years ago
- Official implementation for FlowSep☆75Jan 2, 2025Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆84May 21, 2025Updated last year
- ☆12Nov 7, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆118Jan 28, 2026Updated 4 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Sep 27, 2024Updated last year
- Discogs-VI dataset and code☆21Dec 13, 2024Updated last year
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆18Nov 19, 2024Updated last year
- Single channel speech source separation by diffusion process (ICASSP 2023)☆126Mar 15, 2024Updated 2 years ago
- Prediction of sound event bounding boxes (SEBBs)☆35Aug 2, 2024Updated last year
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆54Apr 28, 2022Updated 4 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆77Oct 8, 2025Updated 8 months ago
- ☆31Apr 22, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆23Mar 19, 2025Updated last year
- ☆12Mar 11, 2025Updated last year
- ☆26Mar 20, 2024Updated 2 years ago
- ☆221Dec 5, 2024Updated last year
- ☆68Aug 16, 2023Updated 2 years ago
- ☆88Feb 24, 2026Updated 3 months ago
- Official Repository for paper "Ambisonizer: Neural Upmixing as Spherical Harmonics Generation"☆18May 27, 2024Updated 2 years ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆204Dec 13, 2024Updated last year
- ☆28Mar 28, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The source code of Tim-TSENet☆15Apr 22, 2022Updated 4 years ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆217Sep 19, 2024Updated last year
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Jun 23, 2022Updated 3 years ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆23Jul 10, 2024Updated last year
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆106Mar 19, 2024Updated 2 years ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated last year
- ☆87May 21, 2023Updated 3 years ago
- ☆33Dec 23, 2025Updated 5 months ago
- Solos: A Dataset for Audio-Visual Music Analysis☆24Feb 17, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Jan 2, 2025Updated last year
- Query-conditioned target sound extraction model☆30Mar 25, 2025Updated last year
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆254Mar 7, 2025Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆17May 19, 2023Updated 3 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆90May 21, 2025Updated last year
- Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval…☆21Feb 1, 2023Updated 3 years ago