guxm2021 / SVT_SpeechBrain
[TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
☆21Updated 6 months ago
Alternatives and similar repositories for SVT_SpeechBrain:
Users that are interested in SVT_SpeechBrain are comparing it to the libraries listed below
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆36Updated this week
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆21Updated last year
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- Project for MIDI to Audio Synthesis☆22Updated 2 years ago
- A piano music dataset with Audio, Symbolic and Text labels☆25Updated last week
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆23Updated 10 months ago
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆28Updated 2 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆29Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆25Updated 10 months ago
- Polyphonic generalisation of DDSP☆18Updated 10 months ago
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆36Updated 8 months ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆13Updated 4 months ago
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"☆34Updated last year
- Official Implementation of Jointist☆33Updated last year
- Official source codes of airsep☆36Updated 11 months ago
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆40Updated last month
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆36Updated 6 months ago
- Landing Page for All Things Source Separation☆22Updated 4 months ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- ☆17Updated 3 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last month
- ☆16Updated 4 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆46Updated 6 months ago
- TheGlueNote is representation model for note-wise music alignment.☆11Updated 7 months ago