Audio-AGI / FlowSep
This is the code implementation for FlowSep
☆10Updated last week
Related projects ⓘ
Alternatives and complementary repositories for FlowSep
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆66Updated last week
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 9 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆71Updated 2 months ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆132Updated 3 months ago
- ☆40Updated 5 months ago
- Code for Investigating Personalization Methods in Text to Music Generation☆35Updated 7 months ago
- Unofficial download repository for MusicCaps☆44Updated last year
- Codebase and project page for EDMSound☆29Updated last year
- ☆61Updated 7 months ago
- Audiogen Codec☆127Updated 4 months ago
- Robust Singing Voice Transcription and MIDI Extraction☆58Updated this week
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆98Updated 3 weeks ago
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆150Updated 2 months ago
- ☆34Updated 5 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆19Updated 2 weeks ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆31Updated 2 months ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆10Updated 5 months ago
- Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆104Updated last month
- The open source code for SimpleSpeech series☆111Updated last month
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆48Updated last month
- An unofficial PyTorch implementation of VALL-E☆77Updated this week
- Zero-Shot Emotion Style Transfer☆37Updated 7 months ago
- ☆81Updated 2 months ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆29Updated 10 months ago
- GPT-style network for phonemization with durations of text☆62Updated 8 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆50Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆88Updated 3 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆43Updated last month