lucidrains / BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
☆429Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for BS-RoFormer
- Repository for training models for music source separation.☆481Updated last week
- ☆222Updated 9 months ago
- Model for MDX23 music separation contest☆639Updated 4 months ago
- VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer☆322Updated 2 weeks ago
- Colab adaptation of MVSep Model for MDX23 music separation contest☆271Updated last month
- ☆76Updated 2 months ago
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆827Updated 3 months ago
- KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021☆190Updated last year
- Unofficial PyTorch implementation of Music Source Separation with Band-split RNN☆155Updated 5 months ago
- Pytorch implementation of the CREPE pitch tracker☆408Updated 5 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022☆279Updated last year
- unofficial vits2-TTS implementation in pytorch☆488Updated 7 months ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆206Updated last year
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆227Updated last year
- AudioLDM training, finetuning, evaluation and inference.☆210Updated 5 months ago
- Music repair method to convert lossy MP3 compressed music to lossless music.☆136Updated 2 weeks ago
- The Open Source Code of UniAudio☆522Updated 3 months ago
- Self-supervised learning for fast pitch estimation☆191Updated last month
- SOFA: Singing-Oriented Forced Aligner☆137Updated last week
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆284Updated 7 months ago
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆232Updated 8 months ago
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆895Updated 2 months ago
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆239Updated 3 weeks ago
- Singing Voice Synthesis based on VITS, different from VISinger☆187Updated last year
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆275Updated last year
- speech self-supervised representations☆467Updated last year
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆204Updated 4 months ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆227Updated last year
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,211Updated 4 months ago
- BandIt: Cinematic Audio Source Separation☆94Updated 4 months ago