Code for ICASSP2022 paper "Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music"
☆157May 12, 2022Updated 4 years ago
Alternatives and similar repositories for icassp2022-vocal-transcription
Users that are interested in icassp2022-vocal-transcription are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VOCANO: A note transcription framework for singing voice in polyphonic music☆72Aug 9, 2021Updated 4 years ago
- ☆107Aug 23, 2024Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 3 years ago
- This repository is the offical implementation for the paper 《Frequency-Temporal Attention Network for Singing Melody Extraction》.☆40Sep 16, 2022Updated 3 years ago
- Singing Voice Synthesis based on VITS, different from VISinger☆197Nov 13, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Sep 26, 2023Updated 2 years ago
- Transcribe music into lead sheets!☆436May 14, 2025Updated last year
- ☆14Feb 3, 2026Updated 4 months ago
- ☆111Jun 11, 2021Updated 4 years ago
- The official implementation of "TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music"☆42Oct 25, 2022Updated 3 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- Official implementation of SawSing (ISMIR'22)☆275Aug 28, 2022Updated 3 years ago
- SOME: Singing-Oriented MIDI Extractor.☆693Mar 7, 2026Updated 3 months ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆68Mar 5, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher☆182Apr 28, 2023Updated 3 years ago
- "Joint Detection and Classification of Singing Voice Melody Using Convolutional Recurrent Neural Networks"☆131Dec 27, 2019Updated 6 years ago
- The source code of "A Streamlined Encoder/Decoder Architecture for Melody Extraction"☆74Feb 10, 2020Updated 6 years ago
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆119Jan 26, 2025Updated last year
- ☆13Sep 1, 2023Updated 2 years ago
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆38Sep 9, 2023Updated 2 years ago
- An end-to-end chorus detection model DeepChorus.☆37Apr 28, 2026Updated last month
- chorus detection for pop music☆47Feb 2, 2023Updated 3 years ago
- ☆228Dec 29, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆322Jan 25, 2024Updated 2 years ago
- pytorch implementation of JDCNet, singing voice detection and classification network☆54Feb 15, 2023Updated 3 years ago
- Robust Singing Voice Transcription and MIDI Extraction☆121Nov 20, 2024Updated last year
- Singing voice detection☆15Aug 28, 2018Updated 7 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆14Mar 11, 2025Updated last year
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆47Jan 23, 2025Updated last year
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆17Jan 29, 2022Updated 4 years ago
- Semi-supervised learning using teacher-student models for vocal melody extraction☆42Sep 14, 2021Updated 4 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆198Sep 15, 2022Updated 3 years ago
- ☆18Jun 24, 2025Updated 11 months ago
- ☆54Feb 22, 2022Updated 4 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43May 9, 2023Updated 3 years ago
- The implementation of "Symbolic Music Loop Generation with Neural Discrete Representations"☆34Aug 24, 2022Updated 3 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆35Apr 22, 2024Updated 2 years ago
- SOFA: Singing-Oriented Forced Aligner☆222May 16, 2025Updated last year