Create training data for training a voice cloner for bark text to speech.
β47Jun 13, 2023Updated 2 years ago
Alternatives and similar repositories for bark-data-gen
Users that are interested in bark-data-gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for the bark-voicecloning model. Training and inference.β710Sep 13, 2023Updated 2 years ago
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ21May 17, 2023Updated 3 years ago
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.β14Mar 22, 2023Updated 3 years ago
- β18Jan 20, 2025Updated last year
- [Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separationβ24Aug 2, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Flow control nodes for comfyUI, allowing for more diverse workflowsβ13Apr 3, 2025Updated last year
- Audio generation using diffusion models, in PyTorch.β49Sep 28, 2023Updated 2 years ago
- Community-controlled voice data collection for language preservation and AI development. Companion to 'AI Techniques for Indigenous Cultuβ¦β71May 6, 2026Updated 2 weeks ago
- β63Jan 15, 2024Updated 2 years ago
- This suite of nodes unlocks high-performance parallel processing in ComfyUI by utilizing **Model Replication**. Unlike standard offloadinβ¦β53Feb 24, 2026Updated 3 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANsβ16Jul 19, 2023Updated 2 years ago
- β12May 23, 2024Updated 2 years ago
- Codebase and project page for EDMSoundβ35Nov 20, 2023Updated 2 years ago
- Image inpainting system frontendβ14Jan 31, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversionβ715Jan 19, 2025Updated last year
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamicβ¦β58Aug 15, 2025Updated 9 months ago
- Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorchβ2,620Jan 12, 2025Updated last year
- Site for sharing Bark voicesβ50Mar 25, 2025Updated last year
- Google's TPGST reimplementation.β34Dec 11, 2019Updated 6 years ago
- MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]β64Jan 6, 2026Updated 4 months ago
- DLAS - A configuration-driven trainer for generative modelsβ142Oct 11, 2022Updated 3 years ago
- Replicate Cog'ified MMAudioβ18Apr 2, 2025Updated last year
- Make-A-Video Latent Diffusion Modelβ19Nov 15, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"β36Feb 10, 2026Updated 3 months ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignmentβ68Jul 5, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β20Feb 27, 2024Updated 2 years ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bullβ13Oct 9, 2023Updated 2 years ago
- Voice Conversion method based on speaker styleβ14Aug 7, 2021Updated 4 years ago
- β21May 7, 2026Updated 2 weeks ago
- Web Audio API Node Editorβ16Mar 7, 2023Updated 3 years ago
- Remove generated stories with stray unicode charactersβ12Jan 3, 2024Updated 2 years ago
- β69May 19, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β71Jul 13, 2023Updated 2 years ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1β¦β32Jan 19, 2024Updated 2 years ago
- Basic framework for training Dreambooth Stable Diffusion v1.5 on Banana's v1.0 serverless GPU platformβ37Nov 15, 2022Updated 3 years ago
- Official source codes of airsepβ39Mar 26, 2024Updated 2 years ago
- Voice data <= 10 mins can also be used to train a good VC model!β12Dec 5, 2023Updated 2 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversionβ143Sep 1, 2020Updated 5 years ago
- Deep Learning technology to upscale music.β23Jun 17, 2020Updated 5 years ago