laurenceyoon / real-time-lyrics-alignment
Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024
☆12Updated last month
Related projects ⓘ
Alternatives and complementary repositories for real-time-lyrics-alignment
- Unofficial implementation of NANSY++ in Pytorch Lightning☆49Updated 8 months ago
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆32Updated 2 months ago
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆26Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆22Updated 7 months ago
- Repository for ISMIR 2022 tutorial T3(M): Designing Controllable Synthesis System for Musical Signals☆28Updated last year
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆26Updated 5 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆37Updated last month
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆54Updated last year
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆34Updated last month
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆13Updated 2 years ago
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆38Updated 2 months ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆37Updated last year
- Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513☆63Updated last year
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆20Updated 11 months ago
- ☆12Updated 4 years ago
- ☆34Updated 5 months ago
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆18Updated last week
- ☆21Updated 2 years ago
- Prosody and Pronunciation Modification Network☆44Updated 3 months ago
- ☆40Updated 5 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆31Updated 10 months ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆25Updated 3 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- ☆21Updated 7 months ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated 9 months ago
- ☆49Updated last year
- ☆12Updated last year
- MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metri…☆27Updated last week
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 9 months ago
- million song dataset split for extended clean tag & artist-level stratified☆47Updated last year