seungheondoh/speech-to-music

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/seungheondoh/speech-to-music)

seungheondoh / speech-to-music

Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]

☆17

Alternatives and similar repositories for speech-to-music

Users that are interested in speech-to-music are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

andrebola / contrastive-mir-learning
View on GitHub
This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"
☆15Jun 22, 2023Updated 3 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
CODEJIN / VITS_Diffusion
View on GitHub
☆26Sep 22, 2022Updated 3 years ago
bbanar2 / Exploring_XAI_in_GenMus_via_LSR
View on GitHub
☆14Sep 19, 2021Updated 4 years ago
sadiela / ml-for-audio
View on GitHub
☆18Feb 11, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zhaojw1998 / DAT-CVAE
View on GitHub
Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…
☆21Mar 28, 2023Updated 3 years ago
jeffreyjohnens / style_rank
View on GitHub
☆15Feb 19, 2020Updated 6 years ago
seungheondoh / lp-music-caps
View on GitHub
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆348Apr 8, 2024Updated 2 years ago
jhtonyKoo / music_mixing_style_transfer
View on GitHub
☆181Oct 24, 2023Updated 2 years ago
Sound2Synth / Sound2Synth-Plug-Ins
View on GitHub
Sound2Synth Plug-Ins
☆14Jul 28, 2022Updated 4 years ago
zengchang233 / CrossSinger
View on GitHub
The source code for the paper CrossSinger (asru2023)
☆18Oct 12, 2023Updated 2 years ago
davidmoris688 / finlab
View on GitHub
☆12Mar 25, 2023Updated 3 years ago
jongpillee / music_dataset_split
View on GitHub
☆37Jun 20, 2017Updated 9 years ago
SoMA-group / style-drumsynth
View on GitHub
Style-based Neural Drum Synthesis with GAN inversion
☆33Nov 9, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
astradzhao / music-rfm
View on GitHub
Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…
☆40Oct 26, 2025Updated 9 months ago
Yuer867 / EMO_Harmonizer
View on GitHub
This is the official repository of Emotion-Driven Melody Harmonization via Melodic Variation and Functional Representation.
☆12Sep 25, 2024Updated last year
minzwon / tag-based-music-retrieval
View on GitHub
☆58Nov 2, 2020Updated 5 years ago
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
legoodmanner / jukedrummer
View on GitHub
☆39Mar 10, 2023Updated 3 years ago
ZackHodari / discrete_intonation
View on GitHub
Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…
☆17May 24, 2020Updated 6 years ago
wayne391 / sf_segmenter
View on GitHub
Music segmentation algorithm, based on SF (structural feature)
☆57Feb 8, 2023Updated 3 years ago
YiWeiWayne / Cross-dataset-mood-prediction
View on GitHub
The source code of "Cross-Cultural Music Emotion Recognition by Adversarial Discriminative Domain Adaptation"
☆11Nov 19, 2018Updated 7 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pc2752 / ss_synthesis
View on GitHub
☆17Jul 31, 2019Updated 7 years ago
nii-yamagishilab / SSL-SAS
View on GitHub
Language independent SSL-based Speaker Anonymization system
☆20May 28, 2024Updated 2 years ago
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
PapayaResearch / ctag
View on GitHub
[ICML'24] Creative Text-to-Audio Generation via Synthesizer Programming
☆41Sep 26, 2024Updated last year
suhitaghosh10 / emo-stargan
View on GitHub
Implementation of Emo-StarGAN
☆48Dec 19, 2023Updated 2 years ago
ElunaMamka / NG-Midiformer
View on GitHub
Official code of "N-Gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding"
☆14Apr 10, 2024Updated 2 years ago
NEXTLab-ZJU / MelodyGLM
View on GitHub
☆13Sep 1, 2023Updated 2 years ago
bobcolner / pandas-polygon
View on GitHub
☆15Feb 7, 2021Updated 5 years ago
fearofchou / mmnet
View on GitHub
☆16Apr 10, 2019Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
TomohikoNakamura / dwtls
View on GitHub
Discrete wavelet transform layers with fixed and trainable wavelets
☆22Nov 27, 2022Updated 3 years ago
serkansulun / midi-emotion
View on GitHub
Generates multi-instrument symbolic music (MIDI), based on user-provided emotions from valence-arousal plane.
☆65Mar 5, 2025Updated last year
lingyu123-su / Amadeus
View on GitHub
To make music production easier, we introduce Amadeus , a novel MIDI generation framework. While significantly improving generation quali…
☆16Aug 29, 2025Updated 11 months ago
keshavbhandari / yinyang
View on GitHub
☆20May 7, 2025Updated last year
ilaria-manco / muscaps
View on GitHub
Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)
☆86Dec 3, 2024Updated last year
yuhanghe01 / RiTTA
View on GitHub
Event Relation in Text-to-Audio (TTA) Generation
☆21Feb 26, 2025Updated last year
Lonian6 / SSM-TTM
View on GitHub
Training-Efficient Text-to-Music Generation with State-Space Modeling
☆16Jan 31, 2026Updated 6 months ago