camenduru / audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆17Updated last year
Alternatives and similar repositories for audiocraft:
Users that are interested in audiocraft are comparing it to the libraries listed below
- 1-click launcher for AUTOMATIC1111/stable-diffusion-webui with full SDXL 1.0 support.☆21Updated last year
- ☆111Updated last year
- An extension for text-generation-webui by oobabooga. Adds options to keep tabs on page and to move extensions into a sidebar.☆23Updated last year
- ☆48Updated last year
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆45Updated 9 months ago
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI☆68Updated last year
- Docker image for the Text Generation Web UI: A Gradio web UI for Large Language Models. Supports Transformers, AWQ, GPTQ, llama.cpp (GGUF…☆1Updated 8 months ago
- BabyCommandAGI is designed to test what happens when you combine CLI and LLM, which are older computer interfaces than GUI. Based on Baby…☆46Updated last month
- ☆24Updated last year
- A tool that boosts chatgpt to its maximum potential☆38Updated last year
- A library for defining AI personalities for AI based models.We define a file format, assets and personalized scripts.☆54Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆66Updated last year
- ☆22Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆95Updated last week
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated last year
- Much simpler client for Stable Diffusion WebUI☆16Updated 2 months ago
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Updated last year
- An extension to Oobabooga to add a simple memory function for chat☆24Updated last year
- ☆48Updated last year
- Windows-compatible Fast API implementation of VoiceCraft, the Zero-Shot Speech Editing and Text-to-Speech in the Wild☆21Updated last year
- Extension for Text Generation Webui based on EdgeGPT, a reverse engineered API of Microsoft's Bing Chat AI☆126Updated last year
- ☆12Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆74Updated last year
- MimicMania is a web application that allows you to generate speech and clone voices using text-to-speech technology. With MimicMania, you…☆59Updated 2 years ago
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆70Updated last year
- ☆18Updated last year
- ☆27Updated 5 months ago
- This project aims to bring a more stable and user friendly check GPT interface designed to allow others to implement their own GPT prompt…☆12Updated last year
- ☆79Updated last year
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated last year