suno-ai / barkLinks
🔊 Text-Prompted Generative Audio Model
☆38,961Updated last year
Alternatives and similar repositories for bark
Users that are interested in bark are comparing it to the libraries listed below
Sorting:
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆77,078Updated 8 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,446Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆94,315Updated last month
- Faster Whisper transcription with CTranslate2☆20,833Updated 2 months ago
- A multi-voice TTS system trained with an emphasis on quality☆14,802Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆22,971Updated 10 months ago
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,762Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,959Updated 2 years ago
- Let us control diffusion models!☆33,621Updated last year
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,341Updated 5 months ago
- StableLM: Stability AI Language Models☆15,766Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆102,600Updated this week
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆13,587Updated last year
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,209Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,392Updated 8 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,680Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,265Updated last year
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆26,651Updated last week
- The definitive Web UI for local AI, with powerful features and easy setup.☆46,006Updated last week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,162Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆35,918Updated 9 months ago
- ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.☆9,514Updated 2 weeks ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆20,051Updated this week
- Port of OpenAI's Whisper model in C/C++☆46,518Updated this week
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,456Updated last year
- The simplest way to run LLaMA on your local machine☆12,993Updated last year
- ☆7,846Updated last year
- Locally run an Instruction-Tuned Chat-Style LLM☆10,186Updated 2 years ago
- Code to accompany "A Method for Animating Children's Drawings of the Human Figure"☆12,758Updated 5 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,815Updated 2 years ago