Official implementation for FlowSep
☆70Jan 2, 2025Updated last year
Alternatives and similar repositories for FlowSep
Users that are interested in FlowSep are comparing it to the libraries listed below
Sorting:
- Query-conditioned target sound extraction model☆30Mar 25, 2025Updated 11 months ago
- ☆43Jan 13, 2025Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆114Jan 28, 2026Updated last month
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 9 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆44Oct 30, 2025Updated 4 months ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆51Sep 20, 2025Updated 6 months ago
- ☆207Dec 5, 2024Updated last year
- ☆44Apr 2, 2025Updated 11 months ago
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆131Sep 2, 2025Updated 6 months ago
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆33Nov 9, 2025Updated 4 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆50Nov 11, 2025Updated 4 months ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- Official Implementation of EnCLAP (ICASSP 2024)☆94Jun 2, 2024Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆46May 16, 2025Updated 10 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 10 months ago
- Landing Page for Divide and Remaster v3☆25Jul 29, 2025Updated 7 months ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆197Dec 13, 2024Updated last year
- ☆33Dec 23, 2025Updated 2 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆122Aug 8, 2025Updated 7 months ago
- Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation☆24Nov 4, 2025Updated 4 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆108Jan 17, 2025Updated last year
- This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022☆145Oct 11, 2023Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- ☆117Feb 26, 2026Updated 3 weeks ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated last year
- ☆21Jul 15, 2024Updated last year
- Official code of SenSE.☆76Oct 30, 2025Updated 4 months ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- The open source code for LLM-Codec☆145Aug 18, 2024Updated last year
- A Singing Style Conversion Framework Based On Audio Infilling☆33Apr 28, 2025Updated 10 months ago
- ☆38Jul 4, 2024Updated last year
- Official code for SongEcho☆52Mar 3, 2026Updated 2 weeks ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Jun 24, 2025Updated 8 months ago
- ☆151Apr 25, 2025Updated 10 months ago
- Expressive Anechoic Recordings of Speech (EARS)☆211Jun 25, 2024Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆47Nov 19, 2024Updated last year
- Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations☆64Jan 16, 2025Updated last year