INTERSPEECH2023: Multi-band Time-frequency Attention Network for Singing Melody Extraction from Polyphonic Music
☆32May 27, 2024Updated last year
Alternatives and similar repositories for MTANet
Users that are interested in MTANet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆23Jan 16, 2024Updated 2 years ago
- This repository is the offical implementation for the paper 《Frequency-Temporal Attention Network for Singing Melody Extraction》.☆40Sep 16, 2022Updated 3 years ago
- Semi-supervised learning using teacher-student models for vocal melody extraction☆43Sep 14, 2021Updated 4 years ago
- ISMIR2016: Melody extraction on vocal segments using multi-column deep neural networks☆19May 29, 2017Updated 8 years ago
- ☆15Feb 1, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The source code of "A Streamlined Encoder/Decoder Architecture for Melody Extraction"☆74Feb 10, 2020Updated 6 years ago
- Pytorch implementation of BiFSMN, IJCAI 2022☆22Feb 10, 2023Updated 3 years ago
- ☆16Nov 29, 2024Updated last year
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆12Nov 12, 2022Updated 3 years ago
- Pytorch implementation of BiFSMNv2, TNNLS 2023☆35Feb 10, 2023Updated 3 years ago
- Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441☆11Oct 25, 2022Updated 3 years ago
- singing_melody_extraction☆10Jun 29, 2019Updated 6 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- ☆64Jun 28, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository to storage the 4mula dataset☆10Sep 1, 2021Updated 4 years ago
- This repository contains the trained models and some audio samples for the tPLCnet.☆29Sep 26, 2023Updated 2 years ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆17May 19, 2023Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Sep 27, 2024Updated last year
- The official implementation of "Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization" (CVPR 2025)☆14Nov 20, 2025Updated 4 months ago
- ☆16Sep 19, 2023Updated 2 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆19May 13, 2019Updated 6 years ago
- Technologies for binaurally reproducing ultrasonic and underwater sound sources, such that they are both audible and localisable by a lis…☆21Jan 13, 2026Updated 2 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Jan 15, 2024Updated 2 years ago
- ☆309Jan 25, 2024Updated 2 years ago
- This is the pytorch demo code for Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain, (PTMDA) (IEEE Transactions on Ima…☆11Apr 15, 2022Updated 3 years ago
- Main Melody Extraction with Source-Filter NMF and CRNN☆25Apr 8, 2019Updated 6 years ago
- Implementation of Harmonic Convolution by Harmonic Lowering☆17Nov 11, 2020Updated 5 years ago
- Implementation of The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medic…☆11May 12, 2025Updated 10 months ago
- CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)☆36May 20, 2025Updated 10 months ago
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆18Jun 21, 2023Updated 2 years ago
- ☆11Mar 20, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 5 years ago
- Domain Adaptation with Adversarial Training on Penultimate Activations (AAAI 2023)☆11Aug 1, 2023Updated 2 years ago
- The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss☆14Sep 4, 2023Updated 2 years ago
- "Joint Detection and Classification of Singing Voice Melody Using Convolutional Recurrent Neural Networks"☆131Dec 27, 2019Updated 6 years ago
- ☆34Feb 14, 2025Updated last year
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆27Feb 19, 2021Updated 5 years ago
- Public repository for the ICLR'23 paper "Few-shot domain adaptation for end-to-end communication"☆11Mar 4, 2023Updated 3 years ago