lsfhuihuiff / MusicTI_AAAI2024View external linksLinks
" Music Style Transfer with Time-Varying Inversion of Diffusion Models"
☆60Jul 23, 2024Updated last year
Alternatives and similar repositories for MusicTI_AAAI2024
Users that are interested in MusicTI_AAAI2024 are comparing it to the libraries listed below
Sorting:
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- ONSETS&VELOCITIES real-time piano detection - PyTorch training [EUSIPCO2023]☆28Aug 31, 2023Updated 2 years ago
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆98Feb 24, 2025Updated 11 months ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27May 20, 2025Updated 8 months ago
- Official source codes of airsep☆39Mar 26, 2024Updated last year
- Pytorch Implementation of the Explainable Conditional Adversarial Autoencoder using Saliency Maps and SHAP (J. of Imaging - MDPI)☆12Mar 5, 2025Updated 11 months ago
- Work with symbolic music gen AI easily, based on midi manipulation.☆27Jan 10, 2026Updated last month
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated last year
- ☆180Oct 24, 2023Updated 2 years ago
- A unified model for zero-shot singing voice conversion and synthesis☆22Nov 30, 2022Updated 3 years ago
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆21Mar 28, 2023Updated 2 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- ☆11Nov 7, 2024Updated last year
- MIDI-LLM Official Transformers implementation (NeurIPS AI4Music '25)☆73Nov 10, 2025Updated 3 months ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Dec 5, 2021Updated 4 years ago
- ☆55Nov 5, 2024Updated last year
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆146Mar 23, 2024Updated last year
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆31Oct 26, 2025Updated 3 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆12Nov 30, 2021Updated 4 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Apr 8, 2020Updated 5 years ago
- ☆16Sep 19, 2023Updated 2 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- [InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"☆13Mar 14, 2024Updated last year
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- ☆82Jan 22, 2025Updated last year
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆82Oct 19, 2023Updated 2 years ago
- Official code for Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion (ICML 2024, Oral).☆86Aug 12, 2024Updated last year
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 2 years ago
- Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE☆15Dec 3, 2022Updated 3 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Jun 28, 2024Updated last year
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆28Mar 3, 2022Updated 3 years ago
- Audio samples of our paper "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network" (accepted by ICASSP2020).☆11Apr 14, 2020Updated 5 years ago
- Online spectrogram inversion for audio source separation☆11Oct 11, 2025Updated 4 months ago
- This repository contains code for an acoustic simulation framework that can be used for acoustic/ultrasonic indoor positioning and/or dat…☆13May 7, 2024Updated last year
- A list of all repositories from Music X Lab☆33Dec 8, 2025Updated 2 months ago