Create training data for training a voice cloner for bark text to speech.
β48Jun 13, 2023Updated 2 years ago
Alternatives and similar repositories for bark-data-gen
Users that are interested in bark-data-gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for the bark-voicecloning model. Training and inference.β711Sep 13, 2023Updated 2 years ago
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ21May 17, 2023Updated 2 years ago
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.β15Mar 22, 2023Updated 3 years ago
- β18Jan 20, 2025Updated last year
- [Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separationβ24Aug 2, 2023Updated 2 years ago
- This suite of nodes unlocks high-performance parallel processing in ComfyUI by utilizing **Model Replication**. Unlike standard offloadinβ¦β42Feb 24, 2026Updated last month
- Flow control nodes for comfyUI, allowing for more diverse workflowsβ13Apr 3, 2025Updated 11 months ago
- Audio generation using diffusion models, in PyTorch.β49Sep 28, 2023Updated 2 years ago
- Tools to create your own voice dataset for TTS trainingβ71Oct 26, 2020Updated 5 years ago
- β64Jan 15, 2024Updated 2 years ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANsβ16Jul 19, 2023Updated 2 years ago
- β12May 23, 2024Updated last year
- Codebase and project page for EDMSoundβ35Nov 20, 2023Updated 2 years ago
- The demo page of UniAudioβ35Feb 5, 2024Updated 2 years ago
- Image inpainting system frontendβ13Jan 31, 2023Updated 3 years ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversionβ707Jan 19, 2025Updated last year
- High-performance Video Super Resolution for ComfyUI with VRAM optimization.β45Feb 13, 2026Updated last month
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamicβ¦β56Aug 15, 2025Updated 7 months ago
- Site for sharing Bark voicesβ51Mar 25, 2025Updated 11 months ago
- MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]β59Jan 6, 2026Updated 2 months ago
- Google's TPGST reimplementation.β34Dec 11, 2019Updated 6 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.β37Feb 24, 2023Updated 3 years ago
- DLAS - A configuration-driven trainer for generative modelsβ142Oct 11, 2022Updated 3 years ago
- Replicate Cog'ified MMAudioβ18Apr 2, 2025Updated 11 months ago
- Make-A-Video Latent Diffusion Modelβ19Nov 15, 2023Updated 2 years ago
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"β36Feb 10, 2026Updated last month
- Barkify: an unoffical training implementation of Bark TTS by suno-aiβ130May 31, 2023Updated 2 years ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignmentβ68Jul 5, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β20Feb 27, 2024Updated 2 years ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bullβ13Oct 9, 2023Updated 2 years ago
- Let AI agents like ChatGPT & Claude use real-world local/remote tools you approve via browser extension + optional MCP serverβ21Sep 29, 2025Updated 5 months ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β257Jun 10, 2024Updated last year
- Voice Conversion method based on speaker styleβ14Aug 7, 2021Updated 4 years ago
- β18May 27, 2025Updated 9 months ago
- Web Audio API Node Editorβ16Mar 7, 2023Updated 3 years ago
- Remove generated stories with stray unicode charactersβ12Jan 3, 2024Updated 2 years ago
- A silly and weirdly useful experiment where I attempt to encode one bit of information with a VAEβ11Dec 31, 2016Updated 9 years ago
- β69May 19, 2023Updated 2 years ago
- Basic framework for training Dreambooth Stable Diffusion v1.5 on Banana's v1.0 serverless GPU platformβ37Nov 15, 2022Updated 3 years ago