zdyshine / beat_track_mgtv_baseline
☆16Updated 3 years ago
Alternatives and similar repositories for beat_track_mgtv_baseline:
Users that are interested in beat_track_mgtv_baseline are comparing it to the libraries listed below
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- Official source codes of coco-mulla☆32Updated 11 months ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆21Updated last year
- Cover Song Detection System☆10Updated 5 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆14Updated last year
- ☆18Updated 3 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆64Updated 3 years ago
- ☆18Updated 3 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- Supplementary material for the ISMIR 2020 paper: “Deconstruct, Analyse, Reconstruct: how to improve tempo, beat, and downbeat estimation”…☆11Updated 4 years ago
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆39Updated 4 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆52Updated last year
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆24Updated last year
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- ☆38Updated 2 years ago
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆70Updated 4 years ago
- Solos: A Dataset for Audio-Visual Music Analysis☆21Updated 2 years ago
- The official PyTorch implementation of paper: An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmen…☆9Updated 3 years ago
- The source code for the paper CrossSinger (asru2023)☆18Updated last year
- Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)☆48Updated 3 weeks ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 7 months ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 3 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆25Updated 10 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- ☆25Updated 2 years ago
- ☆11Updated 2 years ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆35Updated last month