zdyshine / beat_track_mgtv_baseline
☆16Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for beat_track_mgtv_baseline
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- ☆11Updated 2 years ago
- Cover Song Detection System☆10Updated 5 years ago
- The source code for the paper CrossSinger (asru2023)☆18Updated last year
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago
- ☆25Updated last year
- Efficient synchronization from sparse cues☆28Updated 6 months ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- Collection of works from VIPL-AVSU☆40Updated 3 months ago
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆22Updated last year
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Updated 2 years ago
- ☆19Updated last year
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆32Updated this week
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 3 years ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 3 months ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆19Updated 10 months ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆19Updated 3 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Updated 10 months ago
- Official PyTorch implementation of CoverHunter☆24Updated 7 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated last year
- a compact audio-to-phoneme aligner for singing voice☆10Updated 10 months ago
- A purely header only c version of hifi-gan☆8Updated 3 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 2 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 3 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆53Updated 2 years ago
- ☆14Updated last year