Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.
☆16Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for audio-ai-agent
Users that are interested in audio-ai-agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Jan 26, 2024Updated 2 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Basic library for spatial audio SOFA files☆12Sep 29, 2020Updated 5 years ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆32Nov 23, 2023Updated 2 years ago
- A DIY head tracker for 3D audio production☆19Mar 20, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Prediction of sound event bounding boxes (SEBBs)☆34Aug 2, 2024Updated last year
- Ableton MIDI-Clip generation using GPT-4☆49Apr 14, 2026Updated 3 weeks ago
- ☆38Jul 4, 2024Updated last year
- SouPyX: An Audio Exploration Space.🪐☆42Nov 28, 2023Updated 2 years ago
- ☆11Mar 1, 2022Updated 4 years ago
- ☆23Feb 2, 2022Updated 4 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Dec 12, 2024Updated last year
- "An optimizer custom node for ComfyUI that ensures each queue execution starts in an optimal state by clearing unused VRAM and unnecessar…☆19Jul 18, 2025Updated 9 months ago
- Using Convolutional Neural Network to Generate Music☆11Nov 4, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A complete open source framework for live computation and broadcasting of audio descriptors, featuring a plugin, a standalone application…☆42Mar 11, 2026Updated last month
- A Jupyter book accompanying the ISMIR 2023 tutorial Introduction to DIfferentiable Audio Synthesiser Programming☆62Jun 30, 2025Updated 10 months ago
- Event Relation in Text-to-Audio (TTA) Generation☆21Feb 26, 2025Updated last year
- A polyphonic music transcription Vamp plugin☆10Nov 20, 2019Updated 6 years ago
- Pythonic interface to the EMC Unity REST API☆10Mar 9, 2022Updated 4 years ago
- ☆11Apr 30, 2022Updated 4 years ago
- Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆112Dec 20, 2025Updated 4 months ago
- mp3 as VST-effect☆59Oct 20, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- beat swapping powered by AI☆14Jul 7, 2024Updated last year
- Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)