yangdongchao/UniAudio_demo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yangdongchao/UniAudio_demo)

yangdongchao / UniAudio_demo

The demo page of UniAudio

☆35

Alternatives and similar repositories for UniAudio_demo

Users that are interested in UniAudio_demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yangdongchao / UniAudio
View on GitHub
The Open Source Code of UniAudio
☆605Jul 22, 2024Updated 2 years ago
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
AMAAI-Lab / mustango
View on GitHub
Mustango: Toward Controllable Text-to-Music Generation
☆394Jun 2, 2025Updated last year
glory20h / VoiceLDM
View on GitHub
VoiceLDM: Text-to-Speech with Environmental Context
☆194Aug 9, 2024Updated last year
AMAAI-Lab / SonicVerse
View on GitHub
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
☆53Jul 28, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
RetroCirce / Choral_Music_Separation
View on GitHub
Chorale Music Separation Dataset and Model Framework
☆41Dec 5, 2022Updated 3 years ago
neuroidss / audiocraft_neurofeedback
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆20Feb 27, 2024Updated 2 years ago
LAION-AI / Desktop-BUD-E_V1.0
View on GitHub
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆23Oct 10, 2024Updated last year
RMSnow / HAT
View on GitHub
Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.
☆14Mar 22, 2023Updated 3 years ago
teragonaudio / Convolver
View on GitHub
UNMAINTAINED PROJECT
☆14May 26, 2014Updated 12 years ago
SlugLab / wss-ebpf
View on GitHub
eBPF version of https://github.com/brendangregg/wss
☆11Jan 26, 2023Updated 3 years ago
gitmylo / bark-data-gen
View on GitHub
Create training data for training a voice cloner for bark text to speech.
☆47Jun 13, 2023Updated 3 years ago
ciaua / score_lyrics_free_svg
View on GitHub
Score- and Lyrics-Free Singing Voice Generation
☆28May 25, 2020Updated 6 years ago
Narsil / hf-chat
View on GitHub
☆25Dec 13, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
eliohead / glide-finetune-colab
View on GitHub
Colab notebook to finetune GLIDE.
☆12Mar 22, 2022Updated 4 years ago
erl-j / control-synthesis
View on GitHub
☆15Sep 8, 2021Updated 4 years ago
mehdidc / clip_rerank
View on GitHub
Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.
☆15May 3, 2021Updated 5 years ago
tomasJwYU / AutoPrepDemo
View on GitHub
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
☆36Dec 31, 2023Updated 2 years ago
adelacvg / ttts
View on GitHub
Train the next generation of TTS systems.
☆169Sep 13, 2024Updated last year
dmarx / Multi-Modal-Comparators
View on GitHub
Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP
☆39Nov 26, 2022Updated 3 years ago
RetroCirce / MusicLDM
View on GitHub
The latent diffusion model for text-to-music generation.
☆188Jan 26, 2024Updated 2 years ago
lucidrains / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆15May 18, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
neurorave / neurorave
View on GitHub
Continuous descriptor-based control for deep audio synthesis
☆23Aug 4, 2023Updated 2 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
catalpaaa / Mamba-4chan
View on GitHub
☆18Apr 8, 2025Updated last year
deezer / MultilingualMusicGenreEmbedding
View on GitHub
Python code to reproduce the experiments presented in the paper Multilingual Music Genre Embeddings for Effective Cross-Lingual Music Ite…
☆12Nov 13, 2020Updated 5 years ago
SarthakYadav / audiomae-plusplus-official
View on GitHub
Official repository for the paper "AudioMAE++: learning better masked audio representations with SwiGLU FFNs"
☆15Apr 30, 2026Updated 2 months ago
itec-hust / MusicYOLO
View on GitHub
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.
☆18Jan 29, 2022Updated 4 years ago
haoheliu / AudioLDM-training-finetuning
View on GitHub
AudioLDM training, finetuning, evaluation and inference.
☆304Dec 13, 2024Updated last year
revospeech / audio-generation-papers
View on GitHub
recent audio generation papers (including speech, music and general audios)
☆13Mar 14, 2023Updated 3 years ago
AranKomat / Diff-DALLE
View on GitHub
☆65Nov 4, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
happylittlecat2333 / Auffusion
View on GitHub
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…
☆194Mar 25, 2024Updated 2 years ago
jags111 / floral-diffusion
View on GitHub
Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version
☆26Jul 27, 2022Updated 4 years ago
Audio-AGI / WavJourney
View on GitHub
WavJourney: Compositional Audio Creation with LLMs
☆544Sep 28, 2023Updated 2 years ago
moellenh / flatgan
View on GitHub
PyTorch implementation of paper "Flat Metric Minimization with Applications in Generative Modeling"
☆19May 14, 2019Updated 7 years ago
PINTO0309 / sne4onnx
View on GitHub
A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…
☆17Feb 24, 2026Updated 5 months ago
hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year
nii-yamagishilab / midi-to-audio
View on GitHub
Project for MIDI to Audio Synthesis
☆28Mar 13, 2023Updated 3 years ago