Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"
☆18Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for LyricAlignment
Users that are interested in LyricAlignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Sep 4, 2022Updated 3 years ago
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆54Nov 20, 2023Updated 2 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆13Mar 11, 2025Updated last year
- singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/☆20Dec 30, 2019Updated 6 years ago
- ☆17Jun 24, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- ☆12Nov 7, 2024Updated last year
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆27Aug 30, 2024Updated last year
- ☆22Jun 13, 2024Updated last year
- Code for the paper "Songs Across Borders: Singable and Controllable Neural Lyric Translation"☆25Feb 3, 2026Updated 2 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- ☆18Mar 27, 2023Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Roman Numeral Analysis Network with Synthetic Training Examples and Additional Tonal Tasks☆44Feb 11, 2024Updated 2 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆27Apr 23, 2024Updated last year
- version 4.x of the Princeton Geniza Project☆12Updated this week
- Code accompayning ISMIR23 paper; TriAD: Capturing harmonics with 3D convolutions☆19Jul 19, 2024Updated last year
- GenerationMania: Generate IIDX-style rhythm action game stages☆12Aug 3, 2019Updated 6 years ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Mar 4, 2025Updated last year
- dataset, environment, and other resources for mrCAD paper☆22Sep 19, 2025Updated 6 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆35Apr 22, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Feb 20, 2025Updated last year
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆23Apr 23, 2024Updated last year
- A unified model for zero-shot singing voice conversion and synthesis☆22Nov 30, 2022Updated 3 years ago
- Robust Singing Voice Transcription and MIDI Extraction☆117Nov 20, 2024Updated last year
- ☆14Jan 28, 2023Updated 3 years ago
- Official implementation of SawSing (ISMIR'22)☆274Aug 28, 2022Updated 3 years ago
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆24Nov 8, 2021Updated 4 years ago
- 東北イタコ歌唱データベースの最新ラベルデータ☆23Jul 1, 2021Updated 4 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Nov 2, 2020Updated 5 years ago
- ☆18May 4, 2025Updated 11 months ago
- Pytorch implementation of BigVSAN☆202Dec 9, 2025Updated 4 months ago
- STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation☆77Nov 11, 2025Updated 5 months ago
- Deep Learning & Applied AI: Tutorials☆14Jul 5, 2020Updated 5 years ago
- [ICCV 2023] Official PyTorch implementation of "A Multidimensional Analysis of Social Biases in Vision Transformers"☆13Aug 11, 2023Updated 2 years ago
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆38Sep 9, 2023Updated 2 years ago