seungheondoh / speech-to-musicView external linksLinks
Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]
☆17Aug 16, 2023Updated 2 years ago
Alternatives and similar repositories for speech-to-music
Users that are interested in speech-to-music are comparing it to the libraries listed below
Sorting:
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆21Mar 28, 2023Updated 2 years ago
- This is the official repository of Emotion-Driven Melody Harmonization via Melodic Variation and Functional Representation.☆12Sep 25, 2024Updated last year
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Improving Symbolic Music Generation with Inference-Time Alignment☆20Aug 2, 2025Updated 6 months ago
- Sound2Synth Plug-Ins☆13Jul 28, 2022Updated 3 years ago
- ☆22Jul 30, 2025Updated 6 months ago
- ☆14Jun 16, 2023Updated 2 years ago
- Official implementation of WildFX Dataset Generating pipeline.☆15Oct 21, 2025Updated 3 months ago
- The source code of "Cross-Cultural Music Emotion Recognition by Adversarial Discriminative Domain Adaptation"☆11Nov 19, 2018Updated 7 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- ☆13Sep 1, 2023Updated 2 years ago
- Official code of "N-Gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding"☆14Apr 10, 2024Updated last year
- ☆14Sep 19, 2021Updated 4 years ago
- ☆15Feb 19, 2020Updated 5 years ago
- ☆25Apr 18, 2025Updated 10 months ago
- ☆22Jan 29, 2026Updated 2 weeks ago
- ☆13Mar 7, 2024Updated last year
- ☆18Feb 11, 2025Updated last year
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago
- Generates multi-instrument symbolic music (MIDI), based on user-provided emotions from valence-arousal plane.☆65Mar 5, 2025Updated 11 months ago
- ☆28Jul 7, 2025Updated 7 months ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- ☆38Mar 10, 2023Updated 2 years ago
- HairNet: Hairstyle Transfer with Pose Changes☆18Jul 20, 2022Updated 3 years ago
- ☆20May 7, 2025Updated 9 months ago
- Language independent SSL-based Speaker Anonymization system☆19May 28, 2024Updated last year
- ☆37Jun 20, 2017Updated 8 years ago
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Jun 22, 2023Updated 2 years ago
- ☆18Jul 31, 2019Updated 6 years ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆17May 24, 2020Updated 5 years ago
- ☆16Apr 10, 2019Updated 6 years ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27May 20, 2025Updated 8 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- ☆18May 4, 2025Updated 9 months ago
- DNN based singing voice synthesis☆17Oct 15, 2018Updated 7 years ago
- Addressing the confounds of accompaniments in singer identification☆18Mar 24, 2020Updated 5 years ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆343Apr 8, 2024Updated last year