guxm2021 / MM_ALT
[MM 2022 Oral] MM-ALT: A Multimodal Automatic Lyric Transcription System
☆16Updated 6 months ago
Related projects: ⓘ
- [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription☆44Updated 4 months ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆62Updated 2 months ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆28Updated 3 months ago
- ☆93Updated 3 weeks ago
- Music generation☆24Updated 4 months ago
- ☆21Updated 3 weeks ago
- ARCH: Audio Representations benCHmark☆25Updated 3 weeks ago
- ☆50Updated last year
- ☆35Updated last year
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆23Updated 6 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆42Updated 2 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆29Updated 8 months ago
- Robust Singing Voice Transcription and MIDI Extraction☆47Updated last month
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 2 years ago
- Music Audio Representation Benchmark for Universal Evaluation☆84Updated 4 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆73Updated 6 months ago
- Unofficial download repository for MusicCaps☆41Updated last year
- ☆41Updated 2 months ago
- " Music Style Transfer with Time-Varying Inversion of Diffusion Models"☆31Updated last month
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆54Updated last year
- ☆10Updated last week
- Source code for the paper 'Audio Captioning Transformer'☆47Updated 2 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated 11 months ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆18Updated 3 months ago
- This package aims at simplifying the download of the AudioCaps dataset.☆29Updated 9 months ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆30Updated 7 months ago
- Timbre Transfer using Denoising Diffusion Implicit Models (ISMIR 2023)☆26Updated 6 months ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆52Updated 3 weeks ago
- ☆25Updated 5 months ago
- Chorale Music Separation Dataset and Model Framework☆31Updated last year