Yuan-ManX/audio-ai-agent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Yuan-ManX/audio-ai-agent)

Yuan-ManX / audio-ai-agent

Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.

☆16

Alternatives and similar repositories for audio-ai-agent

Users that are interested in audio-ai-agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ftshijt / Interspeech2024_DiscreteSpeechChallenge
View on GitHub
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
☆32Jan 26, 2024Updated 2 years ago
asigalov61 / Euterpe-X
View on GitHub
[DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…
☆33Nov 23, 2023Updated 2 years ago
szz1031 / Tool_ReplaceWwiseWavesWithP4
View on GitHub
A tool to easy reimport waves in Wwise project under P4V
☆11Dec 18, 2024Updated last year
zhaoyx239 / X-Translator
View on GitHub
☆26Jul 21, 2026Updated last week
sgmackie / Wwise_Plugins
View on GitHub
Various plugins created for Wwise
☆25Jul 15, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ex3ndr / supervoice-librilight-preprocessed
View on GitHub
60k hours of phoneme-aligned audio from audio books
☆19Jul 27, 2024Updated 2 years ago
superkittens / libBasicSOFA
View on GitHub
Basic library for spatial audio SOFA files
☆12Sep 29, 2020Updated 5 years ago
dstrub18 / visqol-rs
View on GitHub
The speech quality evalutator ViSQOL written in Rust
☆20Nov 5, 2025Updated 8 months ago
decasteljau / jsfxr-for-wwise
View on GitHub
jsfxr (ported from sfxr) with added Wwise connectivity, embedded into Electron
☆12Apr 3, 2018Updated 8 years ago
t1f7 / soundbank-editor
View on GitHub
Editor for Wwise soundbank files. Feel free to use.
☆15Jul 4, 2016Updated 10 years ago
notam02 / Teensy-Head-Tracker
View on GitHub
A DIY head tracker for 3D audio production
☆19Mar 20, 2023Updated 3 years ago
my-cloud / ruthenium
View on GitHub
Golang implementation of the Ruthenium protocol
☆11Dec 11, 2024Updated last year
Yuan-ManX / SouPyX
View on GitHub
SouPyX: An Audio Exploration Space.🪐
☆41Nov 28, 2023Updated 2 years ago
supersational / JAMMIN-GPT
View on GitHub
Ableton MIDI-Clip generation using GPT-4
☆51Apr 14, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
merlresearch / sebbs
View on GitHub
Prediction of sound event bounding boxes (SEBBs)
☆35Aug 2, 2024Updated last year
HelloZeroNet / ZeroNet-dist
View on GitHub
Binary distributions of ZeroNet
☆12Oct 25, 2019Updated 6 years ago
primepake / F5-TTS-meanflow-multilingual
View on GitHub
Meanflow and multilingual for F5-TTS model
☆16Aug 23, 2025Updated 11 months ago
near-ndc / i-am-human
View on GitHub
NEAR proof of concept for the proof of humanity protocol
☆13Mar 21, 2024Updated 2 years ago
preginald / qi-dao-optimizer
View on GitHub
I created this bot is so that I could sleep at night knowing that my vaults are earning the maximum Qi rewards.
☆11Mar 31, 2022Updated 4 years ago
ressium / Ringcordion
View on GitHub
The accordion made of Ringcon with Unity, Wwise and the X360 controller emulator.
☆21Apr 21, 2021Updated 5 years ago
zeyuxie29 / AudioTime
View on GitHub
☆39Jul 4, 2024Updated 2 years ago
Kickflip89 / Convolution-Music-AI
View on GitHub
Using Convolutional Neural Network to Generate Music
☆11Nov 4, 2020Updated 5 years ago
decasteljau / waapi-import-by-name
View on GitHub
Wwise automatic import from file name using Wwise Authoring API.
☆18Jan 18, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
enlomy / token-transfer-history
View on GitHub
token transfer history
☆13Jul 17, 2024Updated 2 years ago
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆25Dec 12, 2024Updated last year
intro2ddsp / intro2ddsp.github.io
View on GitHub
A Jupyter book accompanying the ISMIR 2023 tutorial Introduction to DIfferentiable Audio Synthesiser Programming
☆62Jun 30, 2025Updated last year
fresh-creations / tammy
View on GitHub
Generative AI for music videos
☆18May 28, 2023Updated 3 years ago
CCA-Lab / VocalStory
View on GitHub
☆16Jun 25, 2025Updated last year
Shy-98 / MELLE
View on GitHub
Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"
☆41Jun 28, 2025Updated last year
yuhanghe01 / RiTTA
View on GitHub
Event Relation in Text-to-Audio (TTA) Generation
☆21Feb 26, 2025Updated last year
Panda-DAO / PandaDAO_Contract
View on GitHub
☆11Apr 30, 2022Updated 4 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WGLabz / LoRaWAN-Energy-Meter
View on GitHub
A LoRaWAN enabled Energy monitoring device using ESP32 and PZEM004T Energy Monitoring Module.
☆12Mar 23, 2021Updated 5 years ago
LEMAS-Project / LEMAS-Edit
View on GitHub
LEMAS‑Edit is a multilingual speech editing system, supporting 10 languages: Chinese English Spanish Russian French German Italian Portug…
☆19Mar 31, 2026Updated 3 months ago
Crauzer / WEMSharp
View on GitHub
Audiokinetic Wwise WEM file converter
☆22Jan 8, 2018Updated 8 years ago
ftshijt / speech_evaluation
View on GitHub
A toolkit dedicate for speech evaluation.
☆23Sep 26, 2024Updated last year
jingzhunxue / FlowMirror_HydraVox
View on GitHub
FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…
☆49Feb 17, 2026Updated 5 months ago
kaistmm / AlignDiT
View on GitHub
[ACM MM 2025] AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation
☆24Oct 28, 2025Updated 9 months ago
decasteljau / waapi-text-to-speech
View on GitHub
Wwise text-to-speech integration using external editors.
☆20Jun 27, 2025Updated last year