suno-ai / barkLinks
🔊 Text-Prompted Generative Audio Model
☆38,264Updated 11 months ago
Alternatives and similar repositories for bark
Users that are interested in bark are comparing it to the libraries listed below
Sorting:
- Faster Whisper transcription with CTranslate2☆17,260Updated last month
- Port of OpenAI's Whisper model in C/C++☆41,811Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆17,031Updated 3 weeks ago
- A multi-voice TTS system trained with an emphasis on quality☆14,440Updated 8 months ago
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,184Updated last year
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,617Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆85,658Updated last month
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,612Updated 8 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆41,599Updated 11 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,904Updated last year
- Code to accompany "A Method for Animating Children's Drawings of the Human Figure"☆12,563Updated 3 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆22,322Updated 4 months ago
- LLM UI with advanced features, easy setup, and multiple backend support.☆44,443Updated this week
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,319Updated last year
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,343Updated 4 months ago
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆73,878Updated 2 months ago
- ☆8,552Updated last year
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆34,091Updated this week
- ☆7,838Updated last year
- Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powere…☆21,904Updated 3 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆13,020Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆83,539Updated this week
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆33,420Updated 3 months ago
- The simplest way to run LLaMA on your local machine☆13,073Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆5,856Updated 11 months ago
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆56,332Updated 8 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆7,948Updated this week
- An unofficial PyTorch implementation of the audio LM VALL-E☆2,987Updated 2 years ago
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆25,559Updated this week
- Let us control diffusion models!☆32,800Updated last year