Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.
☆16Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for audio-ai-agent
Users that are interested in audio-ai-agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Jan 26, 2024Updated 2 years ago
- A tool to easy reimport waves in Wwise project under P4V☆11Dec 18, 2024Updated last year
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Basic library for spatial audio SOFA files☆12Sep 29, 2020Updated 5 years ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆32Nov 23, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 10 months ago
- Various plugins created for Wwise☆25Jul 15, 2019Updated 6 years ago
- jsfxr (ported from sfxr) with added Wwise connectivity, embedded into Electron☆12Apr 3, 2018Updated 8 years ago
- A DIY head tracker for 3D audio production☆19Mar 20, 2023Updated 3 years ago
- Editor for Wwise soundbank files. Feel free to use.☆15Jul 4, 2016Updated 9 years ago
- Prediction of sound event bounding boxes (SEBBs)☆36Aug 2, 2024Updated last year
- Ableton MIDI-Clip generation using GPT-4☆49Apr 14, 2026Updated last month
- SouPyX: An Audio Exploration Space.🪐☆42Nov 28, 2023Updated 2 years ago
- ☆23Feb 2, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Wwise text-to-speech integration using external editors.☆20Jun 27, 2025Updated 11 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Dec 12, 2024Updated last year
- The accordion made of Ringcon with Unity, Wwise and the X360 controller emulator.☆21Apr 21, 2021Updated 5 years ago
- "An optimizer custom node for ComfyUI that ensures each queue execution starts in an optimal state by clearing unused VRAM and unnecessar…☆20Jul 18, 2025Updated 10 months ago
- Using Convolutional Neural Network to Generate Music☆11Nov 4, 2020Updated 5 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Wwise automatic import from file name using Wwise Authoring API.☆18Jan 18, 2019Updated 7 years ago
- A complete open source framework for live computation and broadcasting of audio descriptors, featuring a plugin, a standalone application…☆43Mar 11, 2026Updated 2 months ago
- A Jupyter book accompanying the ISMIR 2023 tutorial Introduction to DIfferentiable Audio Synthesiser Programming☆62Jun 30, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Event Relation in Text-to-Audio (TTA) Generation☆21Feb 26, 2025Updated last year
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- ☆24Aug 17, 2025Updated 9 months ago
- A polyphonic music transcription Vamp plugin☆10Nov 20, 2019Updated 6 years ago
- Pythonic interface to the EMC Unity REST API☆10Mar 9, 2022Updated 4 years ago
- Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"☆41Jun 28, 2025Updated 11 months ago
- ☆11Apr 30, 2022Updated 4 years ago
- ☆10Jun 15, 2022Updated 3 years ago
- ☆60Oct 22, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Gutter Synthesis: Max-based physical-ish synthesis with coupled Duffing oscillators☆55Aug 14, 2023Updated 2 years ago
- Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆112Dec 20, 2025Updated 5 months ago
- mp3 as VST-effect☆60Oct 20, 2024Updated last year
- beat swapping powered by AI☆14Jul 7, 2024Updated last year
- WisBlock API V2 for RAK4631 takes care of all the LoRaWAN, BLE, AT command functionality. It makes development of event driven power savi…☆19Oct 23, 2025Updated 7 months ago
- Audiokinetic Wwise WEM file converter☆22Jan 8, 2018Updated 8 years ago
- Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)☆14Aug 20, 2020Updated 5 years ago