EndlessReform / bark-with-voice-clone
π Text-prompted Generative Audio Model - With the ability to clone voices
β20Updated last year
Alternatives and similar repositories for bark-with-voice-clone:
Users that are interested in bark-with-voice-clone are comparing it to the libraries listed below
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated 9 months ago
- TorToiSe fine-tuning with DLASβ218Updated 6 months ago
- β38Updated 9 months ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ150Updated 7 months ago
- GradioUI for TortoiseTTS voice generationβ34Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β33Updated last year
- Create training data for training a voice cloner for bark text to speech.β43Updated last year
- β80Updated 7 months ago
- Your one-stop solution for voice dataset creationβ117Updated last year
- Audio datasets, easier.β82Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β118Updated 11 months ago
- Oobabooga extension for Bark TTSβ118Updated last year
- Misc. tools/scripts that I made to use for tortoiseβ22Updated 5 months ago
- β147Updated last year
- fine-tuning MusicGen without prompts to generate music with a specific styleβ61Updated last year
- Examples of apps built with Nendo, the AI Audio Tool Suiteβ56Updated 11 months ago
- A simple extension that uses Bark Text-to-Speech for audio outputβ35Updated last year
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tierβ24Updated 2 months ago
- β62Updated 6 months ago
- β14Updated 7 months ago
- Fine-tune your own MusicGen with LoRAβ123Updated 9 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sβ¦β52Updated 9 months ago
- Testbed for the fastest SD pipelinesβ35Updated last year
- MusicGen conditioned with chord progression.β11Updated last year
- β16Updated last year
- β27Updated last year
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.β10Updated last year
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.β67Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β65Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ67Updated last year