camenduru / audiocraftLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆18Updated last year
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- ☆111Updated 2 years ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆71Updated 2 years ago
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆74Updated last year
- This project aims to bring a more stable and user friendly check GPT interface designed to allow others to implement their own GPT prompt…☆12Updated last year
- Extension for Text Generation Webui based on EdgeGPT, a reverse engineered API of Microsoft's Bing Chat AI☆123Updated last year
- An extension for text-generation-webui by oobabooga. Adds options to keep tabs on page and to move extensions into a sidebar.☆23Updated last year
- Oobabooga extension for Bark TTS☆118Updated last year
- 1-click launcher for AUTOMATIC1111/stable-diffusion-webui with full SDXL 1.0 support.☆21Updated last year
- Diffusion_TTS extension for booga☆66Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated 10 months ago
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …☆162Updated last year
- A simple extension that uses Bark Text-to-Speech for audio output☆33Updated last year
- An extension to Oobabooga to add a simple memory function for chat☆25Updated 2 years ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆100Updated 3 weeks ago
- ☆83Updated last year
- ☆22Updated last year
- A library for defining AI personalities for AI based models.We define a file format, assets and personalized scripts.☆54Updated 2 years ago
- ☆79Updated last year
- A 1-click launcher for https://github.com/comfyanonymous/ComfyUI☆35Updated last year
- Much simpler client for Stable Diffusion WebUI☆16Updated 6 months ago
- Integrate image generation capabilities to text-generation-webui using Stable Diffusion.☆54Updated last year
- ☆70Updated last month
- ☆142Updated last month
- ☆30Updated 4 months ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated 2 years ago
- An approach to creating the perfect prompt for any image generation task.☆29Updated 2 years ago
- A Qt GUI for large language models☆44Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆47Updated 5 months ago
- Uses ChatGPT, TTS, and Stable Diffusion to automatically generate videos☆29Updated 2 years ago