zdyshine / beat_track_mgtv_baseline
☆16Updated 3 years ago
Alternatives and similar repositories for beat_track_mgtv_baseline:
Users that are interested in beat_track_mgtv_baseline are comparing it to the libraries listed below
- Cover Song Detection System☆10Updated 6 years ago
- An end-to-end chorus detection model DeepChorus.☆36Updated 3 years ago
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆19Updated last week
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- ☆18Updated 3 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- ☆12Updated 2 months ago
- ☆27Updated 4 years ago
- Official PyTorch implementation of CoverHunter☆29Updated 4 months ago
- ☆11Updated 2 years ago
- Official source codes of coco-mulla☆34Updated last year
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆52Updated last year
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆39Updated 4 years ago
- a compact audio-to-phoneme aligner for singing voice☆10Updated last year
- Semi-supervised learning using teacher-student models for vocal melody extraction☆42Updated 3 years ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆21Updated last year
- Beat and downbeat tracking on symbolic music data☆34Updated 2 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆67Updated last year
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Updated last year
- 60k hours of phoneme-aligned audio from audio books☆18Updated 8 months ago
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆71Updated 4 years ago
- Solos: A Dataset for Audio-Visual Music Analysis☆21Updated 2 years ago
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆57Updated 5 years ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆36Updated 2 months ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 2 years ago
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆23Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆26Updated 11 months ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned S…☆53Updated 4 years ago