zdyshine / beat_track_mgtv_baseline
☆16Updated 3 years ago
Alternatives and similar repositories for beat_track_mgtv_baseline:
Users that are interested in beat_track_mgtv_baseline are comparing it to the libraries listed below
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆52Updated last year
- 60k hours of phoneme-aligned audio from audio books☆18Updated 6 months ago
- ☆18Updated 2 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆65Updated 2 years ago
- Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)☆41Updated 9 months ago
- Official PyTorch implementation of CoverHunter☆27Updated 2 months ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago
- ☆18Updated 3 years ago
- ☆11Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆39Updated 4 years ago
- audiomod is a project for audio modifications, including audio manipulators such as time-stretching, pitch-shifing, formant-changing, and…☆3Updated 6 months ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆19Updated 3 years ago
- ☆27Updated 3 years ago
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆24Updated last year
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆35Updated 2 weeks ago
- a compact audio-to-phoneme aligner for singing voice☆10Updated last year
- A dataset for Audio-Visual Sound Event Detection in Movies☆26Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Updated 2 years ago
- Cover Song Detection System☆10Updated 5 years ago
- Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"☆12Updated last year
- ☆11Updated 3 years ago
- ☆37Updated last year
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆19Updated last year
- Synthesized singing voice demos of WeSinger 2 paper.