Yuan-ManX / audio-ai-agentLinks
Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.
β16Updated last year
Alternatives and similar repositories for audio-ai-agent
Users that are interested in audio-ai-agent are comparing it to the libraries listed below
Sorting:
- SouPyX: An Audio Exploration Space.πͺβ39Updated last year
- "Fx-Encoder++: Extracting Instrument-wise Audio Effect Representations from Mixtures"β36Updated 3 weeks ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioningβ47Updated last month
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"β34Updated last week
- An official repository of "Music De-limiter Networks via Sample-wise Gain Inversion", which will be presented in WASPAA 2023.β85Updated 10 months ago
- Vocal Synthesis Through MIDI and Vocal Transformation Using RVC (KO, EN, JA, ZH)β32Updated 2 years ago
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understandingβ23Updated 7 months ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generationβ41Updated last year
- Algorithms to automatically recognize guitar effects and retrieve their parameters for timbre reproductionβ25Updated 3 years ago
- Rearrange a music recording to match a new duration - Code for "Music Rearrangement Using Hierarchical Segmentation", ICASSP 2023β44Updated last year
- Official Repository of Unsupervised Lead Sheet Generation via Semantic Compressionβ20Updated last year
- Synthesis of percussion sounds using sinusoidal modelling, DDSP noise synthesis, and a neural source filter approach.β31Updated 8 months ago
- Audio production style transfer with inference-time optimizationβ44Updated 10 months ago
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 fileβ¦β58Updated last month
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]β25Updated last year
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.β99Updated last month
- β51Updated 10 months ago
- β14Updated last year
- source code of "End-to-end Music Remastering System Using Self-supervised and Adversarial Training"β47Updated 2 years ago
- β51Updated 2 years ago
- Repository for MIDI-GPT, a controllable multi-track music machine.β54Updated last week
- General Purpose Audio Effect Removalβ107Updated 2 years ago
- Multitrack music mixing style transfer given a reference song using differentiable mixing console.β52Updated 2 months ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.β43Updated 2 years ago
- β13Updated 7 months ago
- Full models and training code for PESTOβ69Updated last year
- Song Describer is a data collection platform for annotating music with textual descriptions.β59Updated 9 months ago
- β48Updated last year
- Official Implementation of "Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music" (ISMIR 2021)β59Updated 2 years ago
- A fast python library for aligning similar audio snippets passed in as NumPy arraysβ48Updated last week