Create training data for training a voice cloner for bark text to speech.
β47Jun 13, 2023Updated 3 years ago
Alternatives and similar repositories for bark-data-gen
Users that are interested in bark-data-gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for the bark-voicecloning model. Training and inference.β711Sep 13, 2023Updated 2 years ago
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ21May 17, 2023Updated 3 years ago
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.β14Mar 22, 2023Updated 3 years ago
- β18Jan 20, 2025Updated last year
- [Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separationβ24Aug 2, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Flow control nodes for comfyUI, allowing for more diverse workflowsβ13Apr 3, 2025Updated last year
- Audio generation using diffusion models, in PyTorch.β49Sep 28, 2023Updated 2 years ago
- Russian open TTS datasetβ17Nov 5, 2019Updated 6 years ago
- β63Jan 15, 2024Updated 2 years ago
- This suite of nodes unlocks high-performance parallel processing in ComfyUI by utilizing **Model Replication**. Unlike standard offloadinβ¦β55Feb 24, 2026Updated 4 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANsβ16Jul 19, 2023Updated 2 years ago
- β12May 23, 2024Updated 2 years ago
- The demo page of UniAudioβ35Feb 5, 2024Updated 2 years ago
- Image inpainting system frontendβ14Jan 31, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversionβ715Jan 19, 2025Updated last year
- Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorchβ2,620Jan 12, 2025Updated last year
- Google's TPGST reimplementation.β34Dec 11, 2019Updated 6 years ago
- High-performance Video Super Resolution for ComfyUI with VRAM optimization.β58Feb 13, 2026Updated 4 months ago
- MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]β67Jan 6, 2026Updated 5 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.β39Feb 24, 2023Updated 3 years ago
- Replicate Cog'ified MMAudioβ18Apr 2, 2025Updated last year
- Make-A-Video Latent Diffusion Modelβ19Nov 15, 2023Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-aiβ130May 31, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"β36Updated this week
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignmentβ68Jul 5, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β20Feb 27, 2024Updated 2 years ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bullβ13Oct 9, 2023Updated 2 years ago
- Let AI agents like ChatGPT & Claude use real-world local/remote tools you approve via browser extension + optional MCP serverβ26Sep 29, 2025Updated 9 months ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β262Jun 10, 2024Updated 2 years ago
- β21Jun 4, 2026Updated last month
- Web Audio API Node Editorβ16Mar 7, 2023Updated 3 years ago
- Remove generated stories with stray unicode charactersβ12Jan 3, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A silly and weirdly useful experiment where I attempt to encode one bit of information with a VAEβ11Dec 31, 2016Updated 9 years ago
- β69May 19, 2023Updated 3 years ago
- Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorchβ1,332Sep 24, 2023Updated 2 years ago
- β71Jul 13, 2023Updated 2 years ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1β¦β32Jan 19, 2024Updated 2 years ago
- Basic framework for training Dreambooth Stable Diffusion v1.5 on Banana's v1.0 serverless GPU platformβ37Nov 15, 2022Updated 3 years ago
- Official source codes of airsepβ39Mar 26, 2024Updated 2 years ago