smulelabs / windowed-roformerLinks
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆37Updated last month
Alternatives and similar repositories for windowed-roformer
Users that are interested in windowed-roformer are comparing it to the libraries listed below
Sorting:
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆43Updated 7 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Updated last year
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆53Updated 6 months ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆84Updated last month
- ☆51Updated 6 months ago
- ☆13Updated 9 months ago
- ☆21Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Updated last month
- Landing Page for Divide and Remaster v3☆23Updated 4 months ago
- ☆45Updated last year
- Spherical residual vector quantization (SRVQ)☆31Updated last year
- ☆32Updated last year
- ☆49Updated 8 months ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27Updated 7 months ago
- Prediction of sound event bounding boxes (SEBBs)☆31Updated last year
- Prosody and Pronunciation Modification Network☆60Updated 7 months ago
- ☆112Updated 3 months ago
- ☆45Updated 5 months ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆36Updated 10 months ago
- Speech Resynthesis and Language Modeling☆27Updated 6 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50Updated 7 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆43Updated 10 months ago
- ☆27Updated last year
- ☆15Updated 8 months ago
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆80Updated 5 months ago
- A Singing Style Conversion Framework Based On Audio Infilling☆30Updated 7 months ago
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆44Updated 3 months ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆73Updated 6 months ago
- Landing Page for All Things Source Separation☆35Updated 3 months ago