Create training data for training a voice cloner for bark text to speech.
β47Jun 13, 2023Updated 2 years ago
Alternatives and similar repositories for bark-data-gen
Users that are interested in bark-data-gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for the bark-voicecloning model. Training and inference.β711Sep 13, 2023Updated 2 years ago
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ21May 17, 2023Updated 2 years ago
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.β14Mar 22, 2023Updated 3 years ago
- β18Jan 20, 2025Updated last year
- [Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separationβ24Aug 2, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Flow control nodes for comfyUI, allowing for more diverse workflowsβ13Apr 3, 2025Updated last year
- Audio generation using diffusion models, in PyTorch.β49Sep 28, 2023Updated 2 years ago
- This suite of nodes unlocks high-performance parallel processing in ComfyUI by utilizing **Model Replication**. Unlike standard offloadinβ¦β52Feb 24, 2026Updated 2 months ago
- β12May 23, 2024Updated last year
- Codebase and project page for EDMSoundβ35Nov 20, 2023Updated 2 years ago
- Image inpainting system frontendβ14Jan 31, 2023Updated 3 years ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversionβ713Jan 19, 2025Updated last year
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamicβ¦β57Aug 15, 2025Updated 8 months ago
- Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorchβ2,619Jan 12, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Site for sharing Bark voicesβ50Mar 25, 2025Updated last year
- High-performance Video Super Resolution for ComfyUI with VRAM optimization.β53Feb 13, 2026Updated 2 months ago
- Google's TPGST reimplementation.β34Dec 11, 2019Updated 6 years ago
- MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]β61Jan 6, 2026Updated 3 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.β38Feb 24, 2023Updated 3 years ago
- Replicate Cog'ified MMAudioβ18Apr 2, 2025Updated last year
- Make-A-Video Latent Diffusion Modelβ19Nov 15, 2023Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-aiβ130May 31, 2023Updated 2 years ago
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"β36Feb 10, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignmentβ68Jul 5, 2024Updated last year
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bullβ13Oct 9, 2023Updated 2 years ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β260Jun 10, 2024Updated last year
- Voice Conversion method based on speaker styleβ14Aug 7, 2021Updated 4 years ago
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITSβ58Dec 1, 2023Updated 2 years ago
- Remove generated stories with stray unicode charactersβ12Jan 3, 2024Updated 2 years ago
- β69May 19, 2023Updated 2 years ago
- Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorchβ1,334Sep 24, 2023Updated 2 years ago
- β71Jul 13, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Basic framework for training Dreambooth Stable Diffusion v1.5 on Banana's v1.0 serverless GPU platformβ37Nov 15, 2022Updated 3 years ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1β¦β32Jan 19, 2024Updated 2 years ago
- Official source codes of airsepβ39Mar 26, 2024Updated 2 years ago
- Voice data <= 10 mins can also be used to train a good VC model!β12Dec 5, 2023Updated 2 years ago
- Deep Learning technology to upscale music.β23Jun 17, 2020Updated 5 years ago
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transformβ251Jan 14, 2025Updated last year
- β19Sep 4, 2024Updated last year