☆483Jun 9, 2026Updated last week
Alternatives and similar repositories for stable-audio-3
Users that are interested in stable-audio-3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 6 months ago
- ZePolA - A Parametric Equalizer with Interactive Poles and Zeros Control for Digital Signal Processing Education☆26Dec 19, 2025Updated 5 months ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆53May 1, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 3 months ago
- A detector for Meta Ray-Ban glasses☆43Nov 18, 2025Updated 6 months ago
- Encode and decode audio samples to/from compressed latent representations!☆260Sep 19, 2025Updated 8 months ago
- Code accompanying ML4MD ICML 2020 paper - "Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance".☆31Jul 22, 2020Updated 5 years ago
- Ming-omni-tts: Simple and Efficient Unified Generation of Speech, Music, and Sound with Precise Control☆240Feb 26, 2026Updated 3 months ago
- Artificial intelligence bot for live voice improvisation☆30May 22, 2019Updated 7 years ago
- Latent Space Sound Design Tool based on the VAE of stable-audio-open☆15Aug 23, 2024Updated last year
- ☆47Mar 29, 2026Updated 2 months ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆75Aug 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆210Jun 5, 2026Updated last week
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 3 years ago
- ☆47Oct 29, 2025Updated 7 months ago
- ☆89Dec 31, 2025Updated 5 months ago
- Launch your speech synthesis within one minute.☆12May 6, 2024Updated 2 years ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- ☆11Apr 16, 2023Updated 3 years ago
- A script that brute forces the generation of chord progressions using major and minor triads☆13Feb 20, 2020Updated 6 years ago
- [NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆220Dec 9, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ML Audio plug-in example using iPlug2 & ONNX Runtime☆37Dec 2, 2023Updated 2 years ago
- singing voice conversion based on glow-tts☆12Aug 20, 2023Updated 2 years ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆29Feb 11, 2026Updated 4 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆34Jun 1, 2023Updated 3 years ago
- MultiModal Audio Generation in Raw Waveform Space.☆153May 26, 2026Updated 3 weeks ago
- ☆24May 28, 2025Updated last year
- ☆31Nov 5, 2023Updated 2 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Aug 10, 2017Updated 8 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆23Feb 26, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆45Oct 28, 2024Updated last year
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆122Mar 14, 2023Updated 3 years ago
- Official implementation of YingMusic-SVC.☆144Dec 29, 2025Updated 5 months ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated last year
- A transformer that decodes swipes across a smartphone keyboard into words (gesture / swipe / glide typing) (enhanced yandex cup solution)☆15Feb 20, 2026Updated 3 months ago
- Timbre Transfer using Denoising Diffusion Implicit Models (ISMIR 2023)☆28Mar 22, 2025Updated last year
- poorman's ar-dit tts☆45Dec 31, 2025Updated 5 months ago