facebookresearch / audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆21,798Updated last month
Alternatives and similar repositories for audiocraft:
Users that are interested in audiocraft are comparing it to the libraries listed below
- 🔊 Text-Prompted Generative Audio Model☆37,431Updated 7 months ago
- ☆7,789Updated last year
- StableLM: Stability AI Language Models☆15,828Updated last year
- Universal LLM Deployment Engine with ML Compilation☆20,396Updated last week
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,130Updated 9 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,413Updated this week
- Let us control diffusion models!☆31,985Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,653Updated 7 months ago
- 🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.☆33,772Updated 3 weeks ago
- Instruct-tune LLaMA on consumer hardware☆18,889Updated 8 months ago
- Community interface for generative AI☆8,968Updated 11 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,986Updated this week
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf☆24,088Updated 6 months ago
- ☆21,358Updated 5 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,362Updated 8 months ago
- Making large AI models cheaper, faster and more accessible☆40,766Updated last week
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆24,863Updated this week
- Generate 3D objects conditioned on text or images☆11,869Updated 9 months ago
- ☆34,496Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆29,932Updated 9 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆22,237Updated 8 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆39,356Updated 8 months ago
- Port of OpenAI's Whisper model in C/C++☆39,198Updated last week
- WebUI extension for ControlNet☆17,530Updated 8 months ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆40,782Updated 6 months ago
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆43,211Updated this week
- Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch☆3,242Updated last year
- Stable diffusion for real-time music generation☆3,639Updated 8 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,596Updated 9 months ago
- Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.☆20,450Updated last month