yukara-ikemiya / floss-torchView external linksLinks
PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind
☆91Nov 24, 2025Updated 2 months ago
Alternatives and similar repositories for floss-torch
Users that are interested in floss-torch are comparing it to the libraries listed below
Sorting:
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆42Oct 30, 2025Updated 3 months ago
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆47Apr 14, 2025Updated 10 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Apr 27, 2023Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- MUSDB25 - A Fully Multitrack Dataset for Music Source Separation☆13Mar 29, 2025Updated 10 months ago
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 8 months ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- ☆15Nov 10, 2025Updated 3 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆107Jan 17, 2025Updated last year
- ☆32Jul 27, 2022Updated 3 years ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Sep 27, 2025Updated 4 months ago
- ☆19Mar 22, 2024Updated last year
- Full models and training code for PESTO☆75Jun 12, 2024Updated last year
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆32Apr 22, 2024Updated last year
- ☆15Apr 2, 2025Updated 10 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- The world's fastest Python package for calculating integrated loudness (LUFS) from audio data as NumPy arrays☆25Dec 26, 2025Updated last month
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated 10 months ago
- Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.☆43Jan 15, 2026Updated last month
- Code for the paper "Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription"☆40May 5, 2024Updated last year
- "Fx-Encoder++: Extracting Instrument-wise Audio Effect Representations from Mixtures"☆47Aug 23, 2025Updated 5 months ago
- ☆28Nov 15, 2023Updated 2 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆86Dec 20, 2024Updated last year
- code for "BEAT-ALIGNED SPECTROGRAM-TO-SEQUENCE GENERATION OF RHYTHM-GAME CHARTS" (ISMIR 2023 LBD)☆18Jan 29, 2024Updated 2 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆45May 18, 2023Updated 2 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Mar 11, 2024Updated last year
- ☆62Nov 6, 2023Updated 2 years ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated last year
- BandIt: Cinematic Audio Source Separation☆154Jul 29, 2025Updated 6 months ago
- Readability-aware automatic lyrics transcription (ALT) evaluation toolkit☆43Aug 29, 2024Updated last year
- 2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…☆62Sep 1, 2024Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆37Feb 24, 2025Updated 11 months ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆95Jun 12, 2025Updated 8 months ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 6 months ago