Create training data for training a voice cloner for bark text to speech.
β48Jun 13, 2023Updated 2 years ago
Alternatives and similar repositories for bark-data-gen
Users that are interested in bark-data-gen are comparing it to the libraries listed below
Sorting:
- The code for the bark-voicecloning model. Training and inference.β710Sep 13, 2023Updated 2 years ago
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ20May 17, 2023Updated 2 years ago
- [Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separationβ24Aug 2, 2023Updated 2 years ago
- Flow control nodes for comfyUI, allowing for more diverse workflowsβ12Apr 3, 2025Updated 11 months ago
- Replicate Cog'ified MMAudioβ17Apr 2, 2025Updated 11 months ago
- Voice data <= 10 mins can also be used to train a good VC model!β12Dec 5, 2023Updated 2 years ago
- Image inpainting system frontendβ13Jan 31, 2023Updated 3 years ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bullβ13Oct 9, 2023Updated 2 years ago
- Site for sharing Bark voicesβ51Mar 25, 2025Updated 11 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β20Feb 27, 2024Updated 2 years ago
- β17Jan 20, 2025Updated last year
- Everybody Compose: Deep Beats To Musicβ12Apr 12, 2023Updated 2 years ago
- β18May 27, 2025Updated 9 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANsβ16Jul 19, 2023Updated 2 years ago
- Remove generated stories with stray unicode charactersβ12Jan 3, 2024Updated 2 years ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversionβ706Jan 19, 2025Updated last year
- Tools to create your own voice dataset for TTS trainingβ70Oct 26, 2020Updated 5 years ago
- β11Sep 12, 2025Updated 5 months ago
- Voice Conversion method based on speaker styleβ14Aug 7, 2021Updated 4 years ago
- Basic framework for training Dreambooth Stable Diffusion v1.5 on Banana's v1.0 serverless GPU platformβ37Nov 15, 2022Updated 3 years ago
- Web Audio API Node Editorβ16Mar 7, 2023Updated 2 years ago
- EDL composer can be used to create Edit Decision List. EDL, Edit Decision List, is a plain text format that describes a video sequence. β¦β20Mar 4, 2023Updated 3 years ago
- Make-A-Video Latent Diffusion Modelβ19Nov 15, 2023Updated 2 years ago
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-velβ¦β87Dec 22, 2024Updated last year
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β258Jun 10, 2024Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-aiβ130May 31, 2023Updated 2 years ago
- A comprehensive, click to install, fully open-source, Video + Audio Generation AIO Toolkit using advanced prompt engineering plus the poβ¦β21Dec 20, 2024Updated last year
- DLAS - A configuration-driven trainer for generative modelsβ142Oct 11, 2022Updated 3 years ago
- Deep Learning technology to upscale music.β23Jun 17, 2020Updated 5 years ago
- This is a cog implementation of the fine-tuner for Meta's MusicGenβ54Apr 5, 2024Updated last year
- A unified model for zero-shot singing voice conversion and synthesisβ22Nov 30, 2022Updated 3 years ago
- Rough implementation of Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments (Ethan β¦β25Dec 17, 2020Updated 5 years ago
- Music production for silent film clips.β32Apr 30, 2025Updated 10 months ago
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITSβ57Dec 1, 2023Updated 2 years ago
- Simple Python CLI script for downloading N-hours of audio from Youtube, based on a list of music genres.β33Dec 13, 2023Updated 2 years ago
- MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]β58Jan 6, 2026Updated last month
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"β36Feb 10, 2026Updated 3 weeks ago
- β64Jan 15, 2024Updated 2 years ago
- Sing an idea β‘οΈ AI music sampleπ₯πΆβ120Apr 21, 2024Updated last year