Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.
☆16Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for audio-ai-agent
Users that are interested in audio-ai-agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Jan 26, 2024Updated 2 years ago
- A tool to easy reimport waves in Wwise project under P4V☆11Dec 18, 2024Updated last year
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Basic library for spatial audio SOFA files☆12Sep 29, 2020Updated 5 years ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆32Nov 23, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- Various plugins created for Wwise☆25Jul 15, 2019Updated 6 years ago
- A DIY head tracker for 3D audio production☆18Mar 20, 2023Updated 3 years ago
- jsfxr (ported from sfxr) with added Wwise connectivity, embedded into Electron☆12Apr 3, 2018Updated 7 years ago
- Ableton MIDI-Clip generation using GPT-4☆47Jan 14, 2026Updated 2 months ago
- ☆38Jul 4, 2024Updated last year
- SouPyX: An Audio Exploration Space.🪐☆42Nov 28, 2023Updated 2 years ago
- Wwise text-to-speech integration using external editors.☆20Jun 27, 2025Updated 9 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The accordion made of Ringcon with Unity, Wwise and the X360 controller emulator.☆21Apr 21, 2021Updated 4 years ago
- "An optimizer custom node for ComfyUI that ensures each queue execution starts in an optimal state by clearing unused VRAM and unnecessar…☆17Jul 18, 2025Updated 8 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A complete open source framework for live computation and broadcasting of audio descriptors, featuring a plugin, a standalone application…☆42Mar 11, 2026Updated 2 weeks ago
- Wwise automatic import from file name using Wwise Authoring API.☆17Jan 18, 2019Updated 7 years ago
- Event Relation in Text-to-Audio (TTA) Generation☆20Feb 26, 2025Updated last year
- A Jupyter book accompanying the ISMIR 2023 tutorial Introduction to DIfferentiable Audio Synthesiser Programming☆62Jun 30, 2025Updated 8 months ago
- A polyphonic music transcription Vamp plugin☆10Nov 20, 2019Updated 6 years ago
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆24Aug 17, 2025Updated 7 months ago
- Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"☆41Jun 28, 2025Updated 9 months ago
- Gutter Synthesis: Max-based physical-ish synthesis with coupled Duffing oscillators☆56Aug 14, 2023Updated 2 years ago
- mp3 as VST-effect☆59Oct 20, 2024Updated last year
- Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)☆14Aug 20, 2020Updated 5 years ago
- Standalone real time dynamic vocal harmonizer☆25Nov 28, 2023Updated 2 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- A REAPER extension for transferring audio files and their corresponding object hierarchies to Wwise projects, streamlining the sound desi…☆50Sep 26, 2025Updated 6 months ago
- Automatic Remix with AI and audio source separation☆11Dec 11, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- VRChat OSC Library for .NET Core 6, C#☆14Nov 17, 2022Updated 3 years ago
- Non Destructive Extensions For VRChat Avatars (built on top of NDMF)☆20Jan 27, 2026Updated 2 months ago
- Musical Agent based on Self-Organizing Maps☆23Feb 6, 2023Updated 3 years ago
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 9 months ago
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆131Sep 2, 2025Updated 6 months ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- ☆21Nov 19, 2021Updated 4 years ago