suno-ai / barkLinks
🔊 Text-Prompted Generative Audio Model
☆38,371Updated last year
Alternatives and similar repositories for bark
Users that are interested in bark are comparing it to the libraries listed below
Sorting:
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆42,107Updated last year
- A multi-voice TTS system trained with an emphasis on quality☆14,511Updated 9 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,983Updated 2 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆22,385Updated 5 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆13,108Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,910Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,632Updated 9 months ago
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,745Updated 11 months ago
- StableLM: Stability AI Language Models☆15,821Updated last year
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,322Updated last year
- one-click face swap☆30,129Updated last year
- ☆7,840Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,111Updated last year
- Let us control diffusion models!☆32,886Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆85,436Updated this week
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,191Updated last year
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus o…☆177,761Updated this week
- Making large AI models cheaper, faster and more accessible☆41,081Updated this week
- 🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.☆34,762Updated 3 months ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆34,022Updated 4 months ago
- Community interface for generative AI☆9,025Updated last year
- Official Code for DragGAN (SIGGRAPH 2023)☆35,915Updated last year
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆12,164Updated this week
- High-Resolution Image Synthesis with Latent Diffusion Models☆41,526Updated last month
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆76,080Updated 2 months ago
- A latent text-to-image diffusion model☆71,331Updated last year
- WebUI extension for ControlNet☆17,763Updated last year
- An unofficial PyTorch implementation of the audio LM VALL-E☆2,984Updated 2 years ago
- SOTA Open Source TTS☆22,694Updated 3 weeks ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,120Updated last year