[TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
☆26Aug 30, 2024Updated last year
Alternatives and similar repositories for SVT_SpeechBrain
Users that are interested in SVT_SpeechBrain are comparing it to the libraries listed below
Sorting:
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Frechet Audio Distance evaluation in PyTorch☆36Jun 9, 2023Updated 2 years ago
- High-Resolution Violin Transcription using Weak Labels☆36Oct 29, 2023Updated 2 years ago
- ☆24Feb 20, 2024Updated 2 years ago
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆17Nov 19, 2024Updated last year
- ☆12Feb 3, 2026Updated last month
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆12Nov 30, 2021Updated 4 years ago
- This repository provides the materials used in "Unsupervised Melody-to-Lyric Generation" by Yufei Tian, Anjali Narayan-Chen, Shereen Orab…☆11Jul 6, 2023Updated 2 years ago
- ☆11Dec 17, 2025Updated 2 months ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Sep 4, 2022Updated 3 years ago
- A curated list of resources in audio visual question answering and related area. :-)☆17Jun 29, 2025Updated 8 months ago
- ☆17Apr 8, 2024Updated last year
- Automatically generate a lip-synced avatar based off of a transcript and audio☆15Feb 17, 2023Updated 3 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- ☆32Nov 25, 2023Updated 2 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆33Apr 22, 2024Updated last year
- This is the codes repository for the paper "Emotion-Guided Music Accompaniment Generation based on VAE".☆13Oct 11, 2023Updated 2 years ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆17Feb 16, 2023Updated 3 years ago
- ☆17Jan 20, 2025Updated last year
- Automatic DJ-mixing of tracks☆35Feb 11, 2020Updated 6 years ago
- Machine learning tools and framework for automatic music transcription.☆36Jun 17, 2024Updated last year
- Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"☆18Dec 14, 2023Updated 2 years ago
- Code accompayning ISMIR23 paper; TriAD: Capturing harmonics with 3D convolutions☆19Jul 19, 2024Updated last year
- Tutorial covering Open Source tools for Source Separation.☆15Nov 12, 2021Updated 4 years ago
- ☆19Jan 30, 2023Updated 3 years ago
- "Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks", ICASSP 2022 (Data Generation Pipeline)☆40Sep 5, 2022Updated 3 years ago
- Chorale Music Separation Dataset and Model Framework☆40Dec 5, 2022Updated 3 years ago
- ☆18Nov 8, 2024Updated last year
- code for "BEAT-ALIGNED SPECTROGRAM-TO-SEQUENCE GENERATION OF RHYTHM-GAME CHARTS" (ISMIR 2023 LBD)☆18Jan 29, 2024Updated 2 years ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆17May 3, 2023Updated 2 years ago
- 基于DINet的推理服务,推理视频流和视频☆17Nov 8, 2023Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- ☆38Feb 1, 2024Updated 2 years ago
- ☆25Jun 19, 2025Updated 8 months ago
- This repository is for an implementation of the accepted paper "Sketching the Expression: Flexible Rendering of Expressive Piano Performa…☆22Dec 15, 2022Updated 3 years ago
- Rearrange a music recording to match a new duration - Code for "Music Rearrangement Using Hierarchical Segmentation", ICASSP 2023☆45Mar 30, 2024Updated last year
- Code and demo for paper: Zhao et al., "Q&A: Query-Based Representation Learning for Multi-Track Symbolic Music re-Arrangement," IJCAI 202…☆20May 2, 2024Updated last year
- a new stem dataset for Music Demixing research, from the OnAir royalty-free music project☆37Mar 14, 2023Updated 2 years ago