zdyshine / beat_track_mgtv_baselineLinks
☆16Updated 4 years ago
Alternatives and similar repositories for beat_track_mgtv_baseline
Users that are interested in beat_track_mgtv_baseline are comparing it to the libraries listed below
Sorting:
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆70Updated 3 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 4 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆53Updated last year
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 4 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 4 years ago
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆26Updated last month
- Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification☆34Updated 5 years ago
- ☆20Updated 3 years ago
- Semi-supervised learning using teacher-student models for vocal melody extraction☆43Updated 4 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Updated 3 years ago
- Cover Song Detection System☆10Updated 6 years ago
- a compact audio-to-phoneme aligner for singing voice☆12Updated last year
- An Open-Source Project to Unify Audio Processing and Generation☆68Updated last month
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆28Updated 3 years ago
- An end-to-end chorus detection model DeepChorus.☆37Updated 3 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Updated 3 years ago
- A Tiny Project For ASR model training and Deployment☆27Updated 3 years ago
- Official PyTorch implementation of CoverHunter☆30Updated last year
- Simple sinc interpolation in PyTorch.☆14Updated 2 years ago
- RepVgg + HiFiGAN☆35Updated 3 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Updated last year
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 3 years ago
- ☆30Updated 4 years ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆15Updated 2 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Updated 2 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Updated last year
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Updated last year
- SpeechBrain中文文档☆12Updated 4 years ago