The demo page of UniAudio
☆35Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for UniAudio_demo
Users that are interested in UniAudio_demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Open Source Code of UniAudio☆605Jul 22, 2024Updated last year
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- Mustango: Toward Controllable Text-to-Music Generation☆391Jun 2, 2025Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆192Aug 9, 2024Updated last year
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆14Mar 22, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆20Feb 27, 2024Updated 2 years ago
- Chorale Music Separation Dataset and Model Framework☆41Dec 5, 2022Updated 3 years ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆24Oct 10, 2024Updated last year
- UNMAINTAINED PROJECT☆14May 26, 2014Updated 12 years ago
- Create training data for training a voice cloner for bark text to speech.☆47Jun 13, 2023Updated 3 years ago
- Score- and Lyrics-Free Singing Voice Generation☆28May 25, 2020Updated 6 years ago
- eBPF version of https://github.com/brendangregg/wss☆11Jan 26, 2023Updated 3 years ago
- Colab notebook to finetune GLIDE.☆12Mar 22, 2022Updated 4 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15May 3, 2021Updated 5 years ago
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Nov 26, 2022Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15May 18, 2021Updated 5 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆36Dec 31, 2023Updated 2 years ago
- The latent diffusion model for text-to-music generation.☆187Jan 26, 2024Updated 2 years ago
- Train the next generation of TTS systems.☆170Sep 13, 2024Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- A collection of utilities for handling IPA phones.☆27Sep 24, 2023Updated 2 years ago
- Continuous descriptor-based control for deep audio synthesis☆23Aug 4, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆18Jan 29, 2022Updated 4 years ago
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆29Nov 18, 2025Updated 6 months ago
- AudioLDM training, finetuning, evaluation and inference.☆303Dec 13, 2024Updated last year
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Jul 27, 2022Updated 3 years ago
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆193Mar 25, 2024Updated 2 years ago
- [ICML2023] Long-Term Rhythmic Video Soundtracker☆63Jul 28, 2025Updated 10 months ago
- ☆65Nov 4, 2021Updated 4 years ago
- WavJourney: Compositional Audio Creation with LLMs☆542Sep 28, 2023Updated 2 years ago
- ☆81Jan 22, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🎼 text-to-video system for music visualization☆57Feb 15, 2024Updated 2 years ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆15Jul 25, 2023Updated 2 years ago
- The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)☆38Jul 24, 2025Updated 10 months ago
- ☆60Dec 24, 2025Updated 5 months ago
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 3 years ago
- Python code to reproduce the experiments presented in the paper Multilingual Music Genre Embeddings for Effective Cross-Lingual Music Ite…☆12Nov 13, 2020Updated 5 years ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆11Jun 12, 2023Updated 3 years ago