camenduru / audiocraftLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆19Updated last year
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- 1-click launcher for AUTOMATIC1111/stable-diffusion-webui with full SDXL 1.0 support.☆22Updated 2 years ago
- ☆112Updated 2 years ago
- ☆24Updated 2 years ago
- ☆83Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆73Updated 2 years ago
- An extension to Oobabooga to add a simple memory function for chat☆25Updated 2 years ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆106Updated 3 weeks ago
- Oobabooga extension for Bark TTS☆119Updated 2 years ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆73Updated 2 years ago
- ☆72Updated 5 months ago
- A library for defining AI personalities for AI based models.We define a file format, assets and personalized scripts.☆53Updated 2 years ago
- Extension for Text Generation Webui based on EdgeGPT, a reverse engineered API of Microsoft's Bing Chat AI☆124Updated 2 years ago
- Much simpler client for Stable Diffusion WebUI☆16Updated 10 months ago
- Create amazing Stable Diffusion prompts with minimal prompt knowledge. A vicuna based prompt engineering tool for stable diffusion☆91Updated 2 years ago
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI☆71Updated 2 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆33Updated 2 years ago
- ☆18Updated 2 years ago
- ☆47Updated 2 years ago
- ☆54Updated 2 years ago
- Diffusion_TTS extension for booga☆68Updated 4 months ago
- An extension for text-generation-webui by oobabooga. Adds options to keep tabs on page and to move extensions into a sidebar.☆23Updated 2 years ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆48Updated 9 months ago
- Integrate image generation capabilities to text-generation-webui using Stable Diffusion.☆58Updated last year
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated last year
- An approach to creating the perfect prompt for any image generation task.☆28Updated 3 years ago
- A fine-tuned model based on Stable Diffusion to create images in the style of Midjourney☆80Updated last year
- This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).☆43Updated 9 months ago
- 🧩 / ● Open Interpreter - This plugin integrates Open Interpreter into LobeChat, allowing you to control your computer with a chat interf…☆25Updated 2 years ago
- Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differ…☆18Updated 2 years ago