seungheondoh / speech-to-music
Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]
☆17Updated last year
Alternatives and similar repositories for speech-to-music:
Users that are interested in speech-to-music are comparing it to the libraries listed below
- Deep Performer: Score-to-audio music performance synthesis☆42Updated last year
- Project for MIDI to Audio Synthesis☆22Updated last year
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆23Updated 8 months ago
- singing voice conversion without f0☆23Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆23Updated 8 months ago
- Landing Page for All Things Source Separation☆19Updated 2 months ago
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆20Updated last year
- ☆21Updated 8 months ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆12Updated 4 months ago
- A unified model for zero-shot singing voice conversion and synthesis☆21Updated 2 years ago
- Repository for Semi-supervised Synthesizer Sound Matching with Differentiable DSP☆20Updated 2 years ago
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆35Updated last year
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆35Updated 4 months ago
- A piano music dataset with Audio, Symbolic and Text labels☆24Updated last month
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆19Updated last year
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- ☆16Updated 4 months ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆16Updated last year
- ☆21Updated 2 years ago
- ☆12Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆36Updated this week
- ☆19Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- music semantic understanding evaluation benchmark☆25Updated last year
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆17Updated last month
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆12Updated 11 months ago