lucidrains / BS-RoFormerLinks
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
☆567Updated 3 weeks ago
Alternatives and similar repositories for BS-RoFormer
Users that are interested in BS-RoFormer are comparing it to the libraries listed below
Sorting:
- Repository for training models for music source separation.☆805Updated 3 weeks ago
- Model for MDX23 music separation contest☆761Updated 3 months ago
- ☆219Updated 5 months ago
- Music repair method to convert lossy MP3 compressed music to lossless music.☆255Updated 4 months ago
- ☆270Updated last year
- KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021☆210Updated 2 years ago
- Pytorch implementation of the CREPE pitch tracker☆454Updated last month
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆948Updated 11 months ago
- Colab adaptation of MVSep Model for MDX23 music separation contest☆311Updated 9 months ago
- VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer☆343Updated 8 months ago
- An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)☆180Updated 2 years ago
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆1,060Updated 10 months ago
- AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI mo…☆777Updated this week
- Unofficial PyTorch implementation of Music Source Separation with Band-split RNN☆178Updated last year
- All-In-One Music Structure Analyzer☆580Updated last year
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,507Updated this week
- SOFA: Singing-Oriented Forced Aligner☆171Updated last month
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆231Updated 3 weeks ago
- This toolbox aims to unify audio generation model evaluation for easier comparison.☆347Updated 9 months ago
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆282Updated 3 months ago
- DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/☆386Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022☆294Updated last year
- The Open Source Code of UniAudio☆569Updated 11 months ago
- A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (…☆438Updated 2 years ago
- Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (pr…☆799Updated 3 weeks ago
- speech self-supervised representations☆500Updated 2 years ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆332Updated last year
- Self-supervised learning for fast pitch estimation☆241Updated 4 months ago
- General Speech Restoration☆1,186Updated 4 months ago
- Pitch Estimating Neural Networks (PENN)☆257Updated 3 months ago