Yuer867 / EMO-Disentanger
This is the official repository of ISMIR 2024 paper "Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional Representation"
☆39Updated last month
Related projects ⓘ
Alternatives and complementary repositories for EMO-Disentanger
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆28Updated 3 weeks ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆36Updated last month
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆71Updated 2 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆62Updated last week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆42Updated this week
- Codebase and project page for EDMSound☆29Updated 11 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆53Updated 6 months ago
- ☆61Updated 3 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated this week
- CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models☆42Updated this week
- The official GitHub page for the survey paper "Foundation Models for Music: A Survey".☆91Updated 2 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆30Updated last month
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆61Updated 4 months ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆10Updated 4 months ago
- Supervoice diffusion enhance☆25Updated 3 months ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆139Updated 10 months ago
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆11Updated this week
- ☆139Updated 3 weeks ago
- Video Background Music Generation Using Unpaired Audio-Visual Data☆21Updated last month
- The demo page of UniAudio☆34Updated 9 months ago
- The open source implementation of the cross attention mechanism from the paper: "JOINTLY TRAINING LARGE AUTOREGRESSIVE MULTIMODAL MODELS"☆22Updated 7 months ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 5 months ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆39Updated this week
- Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model☆146Updated 3 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆11Updated 4 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆76Updated 10 months ago
- This repository aims to collect Transformer-based sound event detection (SED) algorithms.☆34Updated last week
- A Graph Deep Learning Library for Music.☆75Updated this week
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆16Updated last month
- GPT-style network for phonemization with durations of text☆62Updated 7 months ago