yukara-ikemiya / floss-torchLinks
PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind
☆62Updated last week
Alternatives and similar repositories for floss-torch
Users that are interested in floss-torch are comparing it to the libraries listed below
Sorting:
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆48Updated 2 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆37Updated 2 months ago
- Source code of APNet2, a vocoder☆55Updated last year
- ☆24Updated 10 months ago
- ☆28Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆27Updated last year
- ☆47Updated 4 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆49Updated 3 months ago
- ☆48Updated last month
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆29Updated last year
- ☆44Updated last year
- ☆13Updated 4 months ago
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆52Updated last month
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆26Updated 2 months ago
- A Singing Style Conversion Framework Based On Audio Infilling☆26Updated 3 months ago
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆37Updated 3 months ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆23Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆74Updated 6 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆38Updated 5 months ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- Prosody and Pronunciation Modification Network☆56Updated 2 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- ☆101Updated 11 months ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆30Updated 5 months ago
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆84Updated last month
- Spherical residual vector quantization (SRVQ)☆30Updated 11 months ago
- ☆63Updated last year
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated 3 weeks ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆47Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆29Updated 11 months ago