suno-ai/bark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/suno-ai/bark)

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

☆39,196

Alternatives and similar repositories for bark

Users that are interested in bark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,750Aug 16, 2024Updated last year
facebookresearch / audiocraft
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆23,471Mar 3, 2026Updated 4 months ago
neonbjb / tortoise-tts
View on GitHub
A multi-voice TTS system trained with an emphasis on quality
☆14,860Nov 19, 2024Updated last year
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆36,936Apr 19, 2025Updated last year
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆104,926Apr 15, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Vision-CAIR / MiniGPT-4
View on GitHub
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
☆25,662Sep 2, 2024Updated last year
Significant-Gravitas / AutoGPT
View on GitHub
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus o…
☆185,532Updated this week
oobabooga / textgen
View on GitHub
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
☆47,447Jun 2, 2026Updated last month
AIGC-Audio / AudioGPT
View on GitHub
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
☆10,169Jul 6, 2024Updated 2 years ago
lm-sys / FastChat
View on GitHub
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,491May 1, 2026Updated 2 months ago
nomic-ai / gpt4all
View on GitHub
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
☆77,395May 27, 2025Updated last year
AUTOMATIC1111 / stable-diffusion-webui
View on GitHub
Stable Diffusion web UI
☆164,244Mar 2, 2026Updated 4 months ago
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,259Jun 9, 2026Updated last month
ggml-org / whisper.cpp
View on GitHub
Port of OpenAI's Whisper model in C/C++
☆51,802Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
serp-ai / bark-with-voice-clone
View on GitHub
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
☆3,337Aug 24, 2025Updated 10 months ago
Comfy-Org / ComfyUI
View on GitHub
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
☆120,715Updated this week
ggml-org / llama.cpp
View on GitHub
LLM inference in C/C++
☆120,346Updated this week
LAION-AI / Open-Assistant
View on GitHub
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…
☆37,388Aug 17, 2024Updated last year
langchain-ai / langchain
View on GitHub
The agent engineering platform.
☆141,761Updated this week
Stability-AI / StableLM
View on GitHub
StableLM: Stability AI Language Models
☆15,686Apr 8, 2024Updated 2 years ago
OpenTalker / SadTalker
View on GitHub
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
☆13,951Jun 26, 2024Updated 2 years ago
Plachtaa / VALL-E-X
View on GitHub
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
☆7,937Feb 11, 2024Updated 2 years ago
CorentinJ / Real-Time-Voice-Cloning
View on GitHub
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆60,023Mar 9, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
XingangPan / DragGAN
View on GitHub
Official Code for DragGAN (SIGGRAPH 2023)
☆35,804May 18, 2024Updated 2 years ago
2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,612Apr 10, 2026Updated 3 months ago
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆50,842Updated this week
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,276Nov 19, 2025Updated 7 months ago
openinterpreter / openinterpreter
View on GitHub
A lightweight coding agent, optimized for open models like GLM, Deepseek, and Kimi
☆64,946Updated this week
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,932Mar 25, 2026Updated 3 months ago
RVC-Boss / GPT-SoVITS
View on GitHub
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆59,774Updated this week
zylon-ai / private-gpt
View on GitHub
Complete API layer for private AI applications on local models: RAG, skills, tools, MCP, text-to-sql, and more. Works with any OpenAI-com…
☆57,331Updated this week
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,066Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
reworkd / AgentGPT
View on GitHub
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
☆36,274Apr 29, 2025Updated last year
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,312Aug 10, 2024Updated last year
facebookresearch / seamless_communication
View on GitHub
Foundational Models for State-of-the-Art Speech and Text Translation
☆11,816Apr 8, 2026Updated 3 months ago
QuivrHQ / quivr
View on GitHub
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products wi…
☆39,206Jul 9, 2025Updated last year
ollama / ollama
View on GitHub
Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
☆176,107Updated this week
hpcaitech / ColossalAI
View on GitHub
Making large AI models cheaper, faster and more accessible
☆41,412Updated this week
facefusion / facefusion
View on GitHub
Industry leading face manipulation platform
☆29,277Updated this week