lucidrains / BS-RoFormerLinks
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
☆542Updated 4 months ago
Alternatives and similar repositories for BS-RoFormer
Users that are interested in BS-RoFormer are comparing it to the libraries listed below
Sorting:
- Repository for training models for music source separation.☆756Updated 3 weeks ago
- ☆202Updated 4 months ago
- Model for MDX23 music separation contest☆736Updated last month
- ☆257Updated last year
- Music repair method to convert lossy MP3 compressed music to lossless music.☆241Updated 2 months ago
- Pytorch implementation of the CREPE pitch tracker☆445Updated 2 weeks ago
- Colab adaptation of MVSep Model for MDX23 music separation contest☆309Updated 8 months ago
- KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021☆205Updated 2 years ago
- VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer☆339Updated 6 months ago
- BandIt: Cinematic Audio Source Separation☆120Updated 10 months ago
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆929Updated 9 months ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆214Updated last year
- Unofficial PyTorch implementation of Music Source Separation with Band-split RNN☆176Updated 11 months ago
- General Speech Restoration☆1,154Updated 3 months ago
- SOFA: Singing-Oriented Forced Aligner☆168Updated 2 weeks ago
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆1,030Updated 8 months ago
- An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)☆176Updated last year
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆124Updated 2 months ago
- Preprocess Audio for training☆340Updated 2 months ago
- AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI mo…☆745Updated 3 months ago
- model_repo☆121Updated 2 years ago
- Self-supervised learning for fast pitch estimation☆232Updated 3 months ago
- The Open Source Code of UniAudio☆562Updated 10 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022☆290Updated last year
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,450Updated this week
- ☆143Updated 3 months ago
- HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆361Updated 7 months ago
- General Speech Restoration☆278Updated last year
- Pitch Estimating Neural Networks (PENN)☆253Updated last month
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆410Updated 8 months ago