yangdongchao / UniAudio_demoView external linksLinks
The demo page of UniAudio
☆34Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for UniAudio_demo
Users that are interested in UniAudio_demo are comparing it to the libraries listed below
Sorting:
- The Open Source Code of UniAudio☆598Jul 22, 2024Updated last year
- Colab notebook to finetune GLIDE.☆12Mar 22, 2022Updated 3 years ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15May 3, 2021Updated 4 years ago
- Mustango: Toward Controllable Text-to-Music Generation☆388Jun 2, 2025Updated 8 months ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆13Jul 25, 2023Updated 2 years ago
- Contrastive Language-Audio Pretraining☆15May 18, 2021Updated 4 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆20Feb 27, 2024Updated last year
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Nov 26, 2022Updated 3 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- ☆15Sep 8, 2021Updated 4 years ago
- ☆17Feb 20, 2023Updated 2 years ago
- Chorale Music Separation Dataset and Model Framework☆40Dec 5, 2022Updated 3 years ago
- ☆18Apr 8, 2025Updated 10 months ago
- ☆13May 18, 2023Updated 2 years ago
- Continuous descriptor-based control for deep audio synthesis☆23Aug 4, 2023Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆48Jun 13, 2023Updated 2 years ago
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Feb 6, 2026Updated last week
- Implementations of growing and pruning in neural networks☆22Jul 26, 2023Updated 2 years ago
- PyTorch implementation of paper "Flat Metric Minimization with Applications in Generative Modeling"☆19May 14, 2019Updated 6 years ago
- VoiceLDM: Text-to-Speech with Environmental Context☆191Aug 9, 2024Updated last year
- A CLIP conditioned Decision Transformer.☆22Jul 14, 2021Updated 4 years ago
- checkpoints for glide finetuned on laion and other datasets. wip.☆50Aug 17, 2022Updated 3 years ago
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆25Jul 2, 2024Updated last year
- ☆160Jun 13, 2022Updated 3 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Dec 8, 2022Updated 3 years ago
- Official Source code of "One-Shot Adaptation of GAN in Just One CLIP" IEEE Transactions on Pattern Anaylsis and Machine Intelligence (TPA…☆65Jun 5, 2023Updated 2 years ago
- ☆27Jul 25, 2023Updated 2 years ago
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆193Mar 25, 2024Updated last year
- An unofficial implementation of the paper titled "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network".☆27Apr 17, 2020Updated 5 years ago
- Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt☆140Jan 3, 2024Updated 2 years ago
- ☆82Jan 22, 2025Updated last year
- WavJourney: Compositional Audio Creation with LLMs☆541Sep 28, 2023Updated 2 years ago
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆39Jul 8, 2024Updated last year
- ☆36Sep 20, 2022Updated 3 years ago
- A convolutional generative audio synthesis model☆32Jun 17, 2022Updated 3 years ago
- Contrastive Language-Image Pretraining☆144Sep 6, 2022Updated 3 years ago
- Tool which helps you to create clear mobile architecture in React.js☆14Nov 27, 2018Updated 7 years ago
- ☆14Updated this week
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Jan 15, 2021Updated 5 years ago