Yuan-ManX / audio-ai-agent
Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.
☆11Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for audio-ai-agent
- SouPyX: An Audio Exploration Space.🪐☆31Updated 11 months ago
- ☆33Updated 6 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated last week
- BEGANSing - Korean SVS + SVC + AudioSR☆12Updated 9 months ago
- Code for Investigating Personalization Methods in Text to Music Generation☆35Updated 7 months ago
- ☆10Updated 9 months ago
- Project for MIDI to Audio Synthesis☆22Updated last year
- Algorithms to automatically recognize guitar effects and retrieve their parameters for timbre reproduction☆22Updated 2 years ago
- singing voice conversion without f0☆22Updated last year
- Synthesis of percussion sounds using sinusoidal modelling, DDSP noise synthesis, and a neural source filter approach.☆27Updated 11 months ago
- Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression☆18Updated last year
- Official source codes of airsep☆34Updated 7 months ago
- music semantic understanding evaluation benchmark☆25Updated last year
- Audio production style transfer with inference-time optimization☆26Updated this week
- A list of datasets made available by members of the Aalto Acoustics Lab☆19Updated 2 months ago
- Rearrange a music recording to match a new duration - Code for "Music Rearrangement Using Hierarchical Segmentation", ICASSP 2023☆42Updated 7 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆42Updated 3 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆21Updated 3 months ago
- SPAUQ: Spatial Audio Quality Evaluation☆13Updated 6 months ago
- ☆21Updated 7 months ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆29Updated last year
- The source code for the paper CrossSinger (asru2023)☆18Updated last year
- Landing Page for All Things Source Separation☆17Updated 2 weeks ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆10Updated 5 months ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆17Updated 6 months ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 3 months ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆37Updated last year
- Banquet: A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems☆37Updated this week
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆20Updated 11 months ago