facebookresearch / audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆21,943Updated last month
Alternatives and similar repositories for audiocraft:
Users that are interested in audiocraft are comparing it to the libraries listed below
- Let us control diffusion models!☆32,245Updated last year
- ☆7,804Updated last year
- Community interface for generative AI☆8,975Updated last year
- 🔊 Text-Prompted Generative Audio Model☆37,722Updated 8 months ago
- WebUI extension for ControlNet☆17,586Updated 8 months ago
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆43,492Updated this week
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,143Updated 10 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,535Updated 3 weeks ago
- StableLM: Stability AI Language Models☆15,832Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,662Updated 8 months ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,344Updated 8 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,509Updated 5 months ago
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.☆28,897Updated this week
- Generate 3D objects conditioned on text or images☆11,890Updated 10 months ago
- ImageBind One Embedding Space to Bind Them All☆8,632Updated 9 months ago
- fast-stable-diffusion + DreamBooth☆7,727Updated last month
- Stable diffusion for real-time music generation☆3,673Updated 9 months ago
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆73,274Updated last month
- High-Resolution Image Synthesis with Latent Diffusion Models☆40,917Updated 7 months ago
- Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch☆3,248Updated last year
- Official implementation of AnimateDiff.☆11,365Updated 9 months ago
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆25,036Updated this week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆39,821Updated 8 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,667Updated 10 months ago
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,328Updated last year
- Inference code for Llama models☆58,197Updated 3 months ago
- ☆10,607Updated last week
- Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.☆8,611Updated last year
- Port of OpenAI's Whisper model in C/C++☆39,829Updated this week
- Home of StarCoder: fine-tuning & inference!☆7,415Updated last year