The demo page of UniAudio
☆35Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for UniAudio_demo
Users that are interested in UniAudio_demo are comparing it to the libraries listed below
Sorting:
- The Open Source Code of UniAudio☆605Jul 22, 2024Updated last year
- Colab notebook to finetune GLIDE.☆12Mar 22, 2022Updated 3 years ago
- Mustango: Toward Controllable Text-to-Music Generation☆386Jun 2, 2025Updated 9 months ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆14Jul 25, 2023Updated 2 years ago
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Nov 26, 2022Updated 3 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- ☆65Nov 4, 2021Updated 4 years ago
- Chorale Music Separation Dataset and Model Framework☆40Dec 5, 2022Updated 3 years ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Oct 10, 2024Updated last year
- ☆13May 18, 2023Updated 2 years ago
- Continuous descriptor-based control for deep audio synthesis☆23Aug 4, 2023Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆48Jun 13, 2023Updated 2 years ago
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Feb 24, 2026Updated last week
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- PyTorch implementation of paper "Flat Metric Minimization with Applications in Generative Modeling"☆19May 14, 2019Updated 6 years ago
- Implementations of growing and pruning in neural networks☆22Jul 26, 2023Updated 2 years ago
- VoiceLDM: Text-to-Speech with Environmental Context☆192Aug 9, 2024Updated last year
- AI Image Generation Discord Bot @ pollinations.ai 🌸 Used in 1.5k+ servers☆30Feb 1, 2026Updated last month
- A CLIP conditioned Decision Transformer.☆22Jul 14, 2021Updated 4 years ago
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Jul 27, 2022Updated 3 years ago
- ☆24Sep 27, 2022Updated 3 years ago
- [CVPRW'23] The official PyTorch implementation of NamedMask☆23Jun 12, 2023Updated 2 years ago
- ☆27Dec 13, 2024Updated last year
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder☆52Aug 20, 2020Updated 5 years ago
- Score- and Lyrics-Free Singing Voice Generation☆28May 25, 2020Updated 5 years ago
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆25Jul 2, 2024Updated last year
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 2 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- The latent diffusion model for text-to-music generation.☆185Jan 26, 2024Updated 2 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Dec 8, 2022Updated 3 years ago
- (NeurIPS 2021) Pytorch implementation of paper "Re-ranking for image retrieval and transductive few-shot classification"☆31Nov 21, 2021Updated 4 years ago
- ☆160Jun 13, 2022Updated 3 years ago
- Official Source code of "One-Shot Adaptation of GAN in Just One CLIP" IEEE Transactions on Pattern Anaylsis and Machine Intelligence (TPA…☆66Jun 5, 2023Updated 2 years ago
- Become an AI on Roblox.☆12Jan 3, 2026Updated 2 months ago
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆193Mar 25, 2024Updated last year
- Demo for 2022 Interspeech☆29Jun 14, 2022Updated 3 years ago
- An unofficial implementation of the paper titled "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network".☆27Apr 17, 2020Updated 5 years ago
- Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt☆140Jan 3, 2024Updated 2 years ago
- WavJourney: Compositional Audio Creation with LLMs☆540Sep 28, 2023Updated 2 years ago