This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022
☆145Oct 11, 2023Updated 2 years ago
Alternatives and similar repositories for LASS
Users that are interested in LASS are comparing it to the libraries listed below
Sorting:
- Code for "CL4AC: A Contrastive Loss for Audio Captioning", DCASE Workshop 2021.☆45Oct 8, 2021Updated 4 years ago
- Code for "Simple Pooling Front-ends for Efficient Audio Calssification", ICASSP 2023☆57Mar 3, 2023Updated 2 years ago
- Visually-Aware Audio Captioning☆43Mar 3, 2023Updated 2 years ago
- ☆19Sep 2, 2022Updated 3 years ago
- Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021☆69Sep 3, 2021Updated 4 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆210Jul 14, 2022Updated 3 years ago
- Query-conditioned target sound extraction model☆30Mar 25, 2025Updated 11 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆43Dec 6, 2022Updated 3 years ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆44Jul 10, 2024Updated last year
- Official implementation for FlowSep☆70Jan 2, 2025Updated last year
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆80Jul 1, 2022Updated 3 years ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16May 19, 2023Updated 2 years ago
- ☆10Apr 12, 2023Updated 2 years ago
- Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441☆11Oct 25, 2022Updated 3 years ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆114Jan 28, 2026Updated last month
- This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.☆257Jul 25, 2024Updated last year
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- Unofficial PyTorch implementation of Music Source Separation with Band-split RNN☆187Jun 10, 2024Updated last year
- ☆43Feb 21, 2023Updated 3 years ago
- Chorale Music Separation Dataset and Model Framework☆40Dec 5, 2022Updated 3 years ago
- A deep neural network architecture for low-latency audio processing☆323Aug 15, 2023Updated 2 years ago
- ☆23Aug 30, 2022Updated 3 years ago
- ☆40Apr 2, 2025Updated 10 months ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆196Dec 13, 2024Updated last year
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Learning differentiable temporal resolution on time-series data.☆36Nov 12, 2022Updated 3 years ago
- Source code for the paper 'Audio Captioning Transformer'☆57Jan 18, 2022Updated 4 years ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆103Mar 19, 2024Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆91Jun 9, 2022Updated 3 years ago
- KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021☆231Feb 27, 2023Updated 3 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Apr 27, 2023Updated 2 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Sep 1, 2023Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Jun 17, 2022Updated 3 years ago
- ☆140Sep 8, 2025Updated 5 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆46Nov 19, 2024Updated last year
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆472Sep 18, 2025Updated 5 months ago