yukara-ikemiya/floss-torch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yukara-ikemiya/floss-torch)

yukara-ikemiya / floss-torch

PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind

☆97

Alternatives and similar repositories for floss-torch

Users that are interested in floss-torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
yongyizang / TrainingFreeMultiStepASR
View on GitHub
Official Repository for "Training-Free Multi-Step Audio Source Separation"
☆54May 26, 2025Updated last year
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
NVIDIA / audio-intelligence
View on GitHub
Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…
☆137Mar 3, 2026Updated 4 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
qiuqiangkong / audioflow
View on GitHub
☆130Updated this week
smulelabs / smule-renaissance
View on GitHub
Official Repository of Smule Renaissance, Smule's Vocal Restoration Models
☆43Oct 27, 2025Updated 9 months ago
jeonchangbin49 / LimitAug
View on GitHub
☆23Aug 30, 2022Updated 3 years ago
merlresearch / hyper-unmix
View on GitHub
Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…
☆73Apr 27, 2023Updated 3 years ago
AmphionTeam / FlexiCodec
View on GitHub
[ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates
☆51Jul 1, 2026Updated 3 weeks ago
SonyCSLParis / Stem-JEPA
View on GitHub
Joint Embedding Predictive Architecture for Musical Stem Compatibility Estimation
☆55Aug 6, 2024Updated last year
weAreMusicAI / dmx-diffusion
View on GitHub
☆15Oct 13, 2025Updated 9 months ago
YoonjinXD / kadtk
View on GitHub
A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …
☆104Jun 12, 2025Updated last year
yongyizang / GSound-SIR
View on GitHub
A Python Room Spatial Impulse Response Ray-Tracing Toolkit
☆86Mar 4, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kwatcharasupat / divide-and-remaster-v3
View on GitHub
Landing Page for Divide and Remaster v3
☆26Jul 29, 2025Updated 11 months ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
junyuchen-cjy / DTTNet-Pytorch
View on GitHub
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
☆109Mar 19, 2024Updated 2 years ago
ex3ndr / supervoice-gpt-facodec
View on GitHub
GPT for FACodec
☆13Mar 25, 2024Updated 2 years ago
crlandsc / torch-l1-snr
View on GitHub
Variations of L1 SNR Loss function for training audio source separation machine learning models
☆45May 1, 2026Updated 2 months ago
SonyCSLParis / codicodec
View on GitHub
Encode and decode audio samples to/from continuous and discrete compressed representations!
☆121Nov 25, 2025Updated 8 months ago
SonyResearch / ITO-Master
View on GitHub
Implementation of the paper "ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors"
☆27Jul 3, 2025Updated last year
koudounasalkis / voc2vec
View on GitHub
This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.
☆58Apr 14, 2025Updated last year
yongyizang / MSRKit
View on GitHub
Model Implementations, Evaluation Scripts, etc. for Music Source Restoration Challenge 2025.
☆23Nov 14, 2025Updated 8 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
RetroCirce / Choral_Music_Separation
View on GitHub
Chorale Music Separation Dataset and Model Framework
☆41Dec 5, 2022Updated 3 years ago
kwatcharasupat / musdb25
View on GitHub
MUSDB25 - A Fully Multitrack Dataset for Music Source Separation
☆13Mar 29, 2025Updated last year
merlresearch / unified-source-separation
View on GitHub
Official repo for task-aware unified source separation (TUSS)
☆23Jul 31, 2025Updated 11 months ago
jjunak-yun / FLowHigh_code
View on GitHub
[ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"
☆118Jan 17, 2025Updated last year
hhguo / SoCodec
View on GitHub
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆92Dec 20, 2024Updated last year
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
yzGuu830 / efficient-speech-codec
View on GitHub
[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
☆126Mar 20, 2025Updated last year
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
yukara-ikemiya / Open-Miipher-2
View on GitHub
PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind
☆70Sep 22, 2025Updated 10 months ago
nomonosound / log-wmse-audio-quality
View on GitHub
logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…
☆39Jun 24, 2025Updated last year
facebookresearch / dacvae
View on GitHub
DACVAE
☆226Dec 22, 2025Updated 7 months ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
amanteur / BandSplitRNN-PyTorch
View on GitHub
Unofficial PyTorch implementation of Music Source Separation with Band-split RNN
☆191Jun 10, 2024Updated 2 years ago
BUTSpeechFIT / TS_SUPERB
View on GitHub
☆16Apr 2, 2025Updated last year
haoheliu / torchsubband
View on GitHub
Pytorch implementation of subband decomposition
☆93Jul 26, 2022Updated 4 years ago