qiuqiangkong / audio_flowView external linksLinks
☆116Jan 26, 2026Updated 2 weeks ago
Alternatives and similar repositories for audio_flow
Users that are interested in audio_flow are comparing it to the libraries listed below
Sorting:
- ☆56Jul 13, 2025Updated 7 months ago
- ☆29Jul 4, 2025Updated 7 months ago
- ☆21Apr 24, 2025Updated 9 months ago
- A simple library for Fréchet Audio Distance (FAD) calculation☆246Aug 22, 2025Updated 5 months ago
- ☆55Jan 25, 2026Updated 3 weeks ago
- ☆37Jul 4, 2024Updated last year
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆123Sep 2, 2025Updated 5 months ago
- Audio-FLAN☆160Sep 23, 2025Updated 4 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆48Jan 19, 2026Updated 3 weeks ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated 10 months ago
- ☆13Mar 11, 2025Updated 11 months ago
- OpenFLAM: Framewise Language Audio Model☆88Jan 14, 2026Updated last month
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆289Oct 12, 2025Updated 4 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆88Jul 23, 2025Updated 6 months ago
- Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".☆308Aug 4, 2025Updated 6 months ago
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆121Mar 27, 2025Updated 10 months ago
- The official implementation of TokenSynth (ICASSP 2025)☆78Oct 27, 2025Updated 3 months ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆195Dec 13, 2024Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆58Nov 10, 2025Updated 3 months ago
- Unified automatic quality assessment for speech, music, and sound.☆671Jun 5, 2025Updated 8 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆86Dec 20, 2024Updated last year
- [NeurIPS 2025] Separate Anything in Audio with Zero Training☆53Nov 3, 2025Updated 3 months ago
- A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.☆416Sep 15, 2025Updated 5 months ago
- Text-To-Speech for NotebookLM☆37Jul 20, 2025Updated 6 months ago
- MUSDB25 - A Fully Multitrack Dataset for Music Source Separation☆13Mar 29, 2025Updated 10 months ago
- ☆11Nov 7, 2024Updated last year
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆245Mar 7, 2025Updated 11 months ago
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆28Dec 19, 2024Updated last year
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆94Jun 12, 2025Updated 8 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Jul 23, 2025Updated 6 months ago
- ☆70Jan 25, 2025Updated last year
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆63Nov 5, 2025Updated 3 months ago
- ☆155Nov 22, 2024Updated last year
- ☆52Jul 16, 2025Updated 6 months ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆107Jan 17, 2025Updated last year