The demo page of UniAudio
☆35Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for UniAudio_demo
Users that are interested in UniAudio_demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Open Source Code of UniAudio☆604Jul 22, 2024Updated last year
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- Mustango: Toward Controllable Text-to-Music Generation☆385Jun 2, 2025Updated 10 months ago
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆14Mar 22, 2023Updated 3 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆20Feb 27, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Chorale Music Separation Dataset and Model Framework☆40Dec 5, 2022Updated 3 years ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆24Oct 10, 2024Updated last year
- UNMAINTAINED PROJECT☆13May 26, 2014Updated 11 years ago
- Create training data for training a voice cloner for bark text to speech.☆48Jun 13, 2023Updated 2 years ago
- Score- and Lyrics-Free Singing Voice Generation☆28May 25, 2020Updated 5 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15May 3, 2021Updated 4 years ago
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Nov 26, 2022Updated 3 years ago
- Continuous descriptor-based control for deep audio synthesis☆23Aug 4, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Contrastive Language-Audio Pretraining☆15May 18, 2021Updated 4 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆35Dec 31, 2023Updated 2 years ago
- The latent diffusion model for text-to-music generation.☆186Jan 26, 2024Updated 2 years ago
- Train the next generation of TTS systems.☆170Sep 13, 2024Updated last year
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆16Jan 29, 2022Updated 4 years ago
- AudioLDM training, finetuning, evaluation and inference.☆298Dec 13, 2024Updated last year
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Jul 27, 2022Updated 3 years ago
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆193Mar 25, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- recent audio generation papers (including speech, music and general audios)☆13Mar 14, 2023Updated 3 years ago
- ☆65Nov 4, 2021Updated 4 years ago
- [ICML2023] Long-Term Rhythmic Video Soundtracker☆62Jul 28, 2025Updated 8 months ago
- WavJourney: Compositional Audio Creation with LLMs☆540Sep 28, 2023Updated 2 years ago
- ☆82Jan 22, 2025Updated last year
- 🎼 text-to-video system for music visualization☆57Feb 15, 2024Updated 2 years ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆14Jul 25, 2023Updated 2 years ago
- The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)☆37Jul 24, 2025Updated 8 months ago
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆54Feb 12, 2026Updated 2 months ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆11Jun 12, 2023Updated 2 years ago
- DiffPhase: Generative Diffusion-based STFT Phase Retrieval☆16Sep 21, 2023Updated 2 years ago
- My implementation of diffusion (like) models☆11Apr 14, 2023Updated 2 years ago
- Flow control nodes for comfyUI, allowing for more diverse workflows☆13Apr 3, 2025Updated last year
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder☆52Aug 20, 2020Updated 5 years ago
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆29Dec 19, 2024Updated last year