π BARK INFINITY GUI CMD πΆ Powered Up Bark Text-prompted Generative Audio Model
β1,011Oct 21, 2023Updated 2 years ago
Alternatives and similar repositories for bark
Users that are interested in bark are comparing it to the libraries listed below
Sorting:
- π Text-prompted Generative Audio Modelβ237Apr 27, 2023Updated 2 years ago
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ3,343Aug 24, 2025Updated 6 months ago
- Site for sharing Bark voicesβ51Mar 25, 2025Updated 11 months ago
- π Text-Prompted Generative Audio Model with Gradioβ691Nov 23, 2023Updated 2 years ago
- π Text-Prompted Generative Audio Modelβ39,006Aug 19, 2024Updated last year
- The code for the bark-voicecloning model. Training and inference.β710Sep 13, 2023Updated 2 years ago
- One click installer scripts for Bark Infinityβ25Dec 23, 2023Updated 2 years ago
- Fast TorToiSe inference (5x or your money back!)β830Jul 10, 2024Updated last year
- A webui for different audio related Neural Networksβ1,236May 19, 2025Updated 9 months ago
- A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice,β¦β2,989Feb 19, 2026Updated last week
- Oobabooga extension for Bark TTSβ120Nov 23, 2023Updated 2 years ago
- A multi-voice TTS system trained with an emphasis on qualityβ14,813Nov 19, 2024Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,172Aug 10, 2024Updated last year
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ20May 17, 2023Updated 2 years ago
- TorToiSe fine-tuning with DLASβ226Aug 1, 2024Updated last year
- The definitive Web UI for local AI, with powerful features and easy setup.β46,091Feb 3, 2026Updated 3 weeks ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,013Mar 13, 2025Updated 11 months ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ44,608Aug 16, 2024Updated last year
- PALLAIDIUM β a generative AI movie studio, seamlessly integrated into the Blender Video Editor (VSE), enabling end-to-end production fromβ¦β1,344Updated this week
- Foundational model for human-like, expressive TTSβ4,199Jul 30, 2024Updated last year
- [CVPR 2023] SadTalkerοΌLearning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animationβ13,598Jun 26, 2024Updated last year
- β7,843Apr 14, 2024Updated last year
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,682Apr 3, 2024Updated last year
- π Text-Prompted Generative Audio Modelβ92Apr 22, 2023Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-aiβ130May 31, 2023Updated 2 years ago
- An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extensionβ1,964Mar 13, 2024Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitlesβ80Apr 26, 2023Updated 2 years ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generationβ857Nov 16, 2024Updated last year
- π BARK INFINITY πΆ Power Up The Bark Text-prompted Generative Audio Modelβ19May 1, 2023Updated 2 years ago
- Text-to-Audio/Music Generationβ2,580Sep 29, 2024Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,955Feb 11, 2024Updated 2 years ago
- For GGUF support, see KoboldCPP: https://github.com/LostRuins/koboldcppβ3,857Jan 16, 2025Updated last year
- An unofficial PyTorch implementation of the audio LM VALL-Eβ2,993May 10, 2023Updated 2 years ago
- A family of diffusion models for text-to-audio generation.β1,231Jul 29, 2025Updated 7 months ago
- A8R8 (Alternate Reality), an opinionated interface for Stable Diffusion image generation, and more.β122Oct 19, 2025Updated 4 months ago
- Mustango: Toward Controllable Text-to-Music Generationβ386Jun 2, 2025Updated 9 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANsβ16Jul 19, 2023Updated 2 years ago
- Audio datasets, easier.β86Aug 19, 2023Updated 2 years ago
- Official Implementation of StyleTTS-VCβ197Jan 14, 2025Updated last year