facebookresearch / audiocraftView external linksLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
β22,993Mar 13, 2025Updated 11 months ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- π Text-Prompted Generative Audio Modelβ38,970Aug 19, 2024Updated last year
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ44,516Aug 16, 2024Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,745Nov 14, 2024Updated last year
- Text-to-Audio/Music Generationβ2,578Sep 29, 2024Updated last year
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Headβ10,209Jul 6, 2024Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.β35,918Apr 19, 2025Updated 9 months ago
- Generative models for conditional audio generationβ3,599Jan 22, 2026Updated 3 weeks ago
- Generative Models by Stability AIβ26,898Dec 16, 2025Updated last month
- Robust Speech Recognition via Large-Scale Weak Supervisionβ94,628Dec 15, 2025Updated 2 months ago
- State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.β3,883Jan 4, 2024Updated 2 years ago
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β9,687May 27, 2025Updated 8 months ago
- The definitive Web UI for local AI, with powerful features and easy setup.β46,037Feb 3, 2026Updated last week
- Official Code for DragGAN (SIGGRAPH 2023)β35,972May 18, 2024Updated last year
- AudioLDM: Generate speech, sound effects, music and beyond, with text.β2,825Jun 25, 2025Updated 7 months ago
- Official implementation of AnimateDiff.β12,018Jul 31, 2024Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β103,139Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,402Jun 2, 2025Updated 8 months ago
- Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorchβ2,619Jan 12, 2025Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,446Aug 12, 2024Updated last year
- Let us control diffusion models!β33,640Feb 25, 2024Updated last year
- LLM inference in C/C++β94,823Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,956Feb 11, 2024Updated 2 years ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,452Aug 17, 2024Updated last year
- Muzic: Music Understanding and Generation with Artificial Intelligenceβ4,902Oct 12, 2024Updated last year
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.β77,136May 27, 2025Updated 8 months ago
- Stable Diffusion web UIβ160,514Dec 18, 2025Updated last month
- Inference code for Llama modelsβ59,141Jan 26, 2025Updated last year
- StableLM: Stability AI Language Modelsβ15,766Apr 8, 2024Updated last year
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creatβ¦β26,682Feb 7, 2026Updated last week
- Universal LLM Deployment Engine with ML Compilationβ22,039Updated this week
- π¦π The platform for reliable agents.β126,317Updated this week
- CLI platform to experiment with codegen. Precursor to: https://lovable.devβ55,213May 14, 2025Updated 9 months ago
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β32,768Updated this week
- A natural language interface for computersβ62,135Updated this week
- Focus on prompting and generatingβ47,688Dec 1, 2025Updated 2 months ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.β46,977Updated this week
- Industry leading face manipulation platformβ26,787Updated this week
- Stable diffusion for real-time music generationβ3,868Jul 22, 2024Updated last year
- one-click face swapβ30,509Aug 19, 2024Updated last year