gitmylo/bark-data-gen

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gitmylo/bark-data-gen)

gitmylo / bark-data-gen

Create training data for training a voice cloner for bark text to speech.

☆47

Alternatives and similar repositories for bark-data-gen

Users that are interested in bark-data-gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gitmylo / bark-voice-cloning-HuBERT-quantizer
View on GitHub
The code for the bark-voicecloning model. Training and inference.
☆711Sep 13, 2023Updated 2 years ago
EndlessReform / bark-with-voice-clone
View on GitHub
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
☆21May 17, 2023Updated 3 years ago
sakemin / demucs_batch-multigpu
View on GitHub
[Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separation
☆24Aug 2, 2023Updated 2 years ago
gitmylo / FlowNodes
View on GitHub
Flow control nodes for comfyUI, allowing for more diverse workflows
☆13Apr 3, 2025Updated last year
v-nhandt21 / ViMFA
View on GitHub
Montreal Forced Aligner for Vietnamese
☆15Oct 23, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Harmonai-org / audio-diffusion-pytorch-fork
View on GitHub
Audio generation using diffusion models, in PyTorch.
☆49Sep 28, 2023Updated 2 years ago
AgentCooper2002 / EDMSound
View on GitHub
Codebase and project page for EDMSound
☆35Nov 20, 2023Updated 2 years ago
hollygrimm / voice-dataset-creation
View on GitHub
Community-controlled voice data collection for language preservation and AI development. Companion to 'AI Techniques for Indigenous Cultu…
☆71May 6, 2026Updated 2 months ago
cpdu / unicats
View on GitHub
☆63Jan 15, 2024Updated 2 years ago
yangdongchao / UniAudio_demo
View on GitHub
The demo page of UniAudio
☆35Feb 5, 2024Updated 2 years ago
chentuochao / Target-Conversation-Extraction
View on GitHub
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…
☆58Aug 15, 2025Updated 11 months ago
OlaWod / FreeVC
View on GitHub
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
☆714Jan 19, 2025Updated last year
rsxdalv / bark-speaker-directory
View on GitHub
Site for sharing Bark voices
☆50Jul 6, 2026Updated 2 weeks ago
lucidrains / audiolm-pytorch
View on GitHub
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
☆2,622Jan 12, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
CaptainGrock / ComfyUIInvisibleWatermark
View on GitHub
☆13May 23, 2024Updated 2 years ago
Russellwzr / image-inpainting-fe
View on GitHub
Image inpainting system frontend
☆14Jan 31, 2023Updated 3 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
jerryuhoo / VISinger
View on GitHub
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.
☆39Feb 24, 2023Updated 3 years ago
neonbjb / DL-Art-School
View on GitHub
DLAS - A configuration-driven trainer for generative models
☆142Oct 11, 2022Updated 3 years ago
RickyL-2000 / AlignSTS
View on GitHub
Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment
☆68Jul 5, 2024Updated 2 years ago
ishine / open_tts
View on GitHub
Russian open TTS dataset
☆17Nov 5, 2019Updated 6 years ago
anyvoiceai / Barkify
View on GitHub
Barkify: an unoffical training implementation of Bark TTS by suno-ai
☆130May 31, 2023Updated 3 years ago
neuroidss / audiocraft_neurofeedback
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆20Feb 27, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
madhavlab / wav2tok
View on GitHub
Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"
☆36Jun 30, 2026Updated 3 weeks ago
zsxkib / ST-MFNet
View on GitHub
[IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull
☆13Oct 9, 2023Updated 2 years ago
davidmartinrius / speech-dataset-generator
View on GitHub
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
☆262Jun 10, 2024Updated 2 years ago
zcf28 / StyleGAN-VC
View on GitHub
Voice Conversion method based on speaker style
☆14Aug 7, 2021Updated 4 years ago
v-nhandt21 / ViSV2TTS
View on GitHub
Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS
☆56Dec 1, 2023Updated 2 years ago
Volcomix / waane
View on GitHub
Web Audio API Node Editor
☆16Mar 7, 2023Updated 3 years ago
RuiShu / one-bit-vae
View on GitHub
A silly and weirdly useful experiment where I attempt to encode one bit of information with a VAE
☆11Dec 31, 2016Updated 9 years ago
ad8e / TinyStories-cleaner
View on GitHub
Remove generated stories with stray unicode characters
☆12Jan 3, 2024Updated 2 years ago
Kikyo-16 / airgen
View on GitHub
Official source codes of airsep
☆39Mar 26, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
lucidrains / naturalspeech2-pytorch
View on GitHub
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
☆1,333Sep 24, 2023Updated 2 years ago
0417keito / JEN-1-COMPOSER-pytorch
View on GitHub
Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…
☆32Jan 19, 2024Updated 2 years ago
zsxkib / voice-cloning-training
View on GitHub
Voice data <= 10 mins can also be used to train a good VC model!
☆12Dec 5, 2023Updated 2 years ago
lucataco / serverless-template-dreambooth-training
View on GitHub
Basic framework for training Dreambooth Stable Diffusion v1.5 on Banana's v1.0 serverless GPU platform
☆37Nov 15, 2022Updated 3 years ago
lifeiteng / SoundStorm
View on GitHub
☆71Jul 13, 2023Updated 3 years ago
bshall / VectorQuantizedCPC
View on GitHub
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
☆142Sep 1, 2020Updated 5 years ago
rishikksh20 / NaturalSpeech2
View on GitHub
☆69May 19, 2023Updated 3 years ago