facebookresearch / audiocraftLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
β22,846Updated 9 months ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- π Text-Prompted Generative Audio Modelβ38,858Updated last year
- Universal LLM Deployment Engine with ML Compilationβ21,808Updated last week
- β7,845Updated last year
- StableLM: Stability AI Language Modelsβ15,783Updated last year
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Headβ10,209Updated last year
- Stable diffusion for real-time music generationβ3,856Updated last year
- β22,057Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)β25,772Updated last year
- The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.β21,230Updated last year
- Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorchβ3,287Updated 2 years ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,268Updated last year
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,491Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,332Updated 7 months ago
- Robust Speech Recognition via Large-Scale Weak Supervisionβ92,483Updated 2 weeks ago
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdfβ24,500Updated 5 months ago
- π€ Assemble, configure, and deploy autonomous AI Agents in your browser.β35,428Updated 8 months ago
- State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.β3,867Updated last year
- LlamaIndex is the leading framework for building LLM-powered agents over your data.β46,055Updated last week
- Inference code for Llama modelsβ59,014Updated 11 months ago
- The definitive Web UI for local AI, with powerful features and easy setup.β45,744Updated this week
- Muzic: Music Understanding and Generation with Artificial Intelligenceβ4,887Updated last year
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.β12,018Updated last week
- AudioLDM: Generate speech, sound effects, music and beyond, with text.β2,793Updated 6 months ago
- Generate 3D objects conditioned on text or imagesβ12,181Updated last year
- Inference Llama 2 in one file of pure Cβ19,063Updated last year
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus oβ¦β180,542Updated this week
- Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.β8,789Updated 2 years ago
- Text-to-Audio/Music Generationβ2,551Updated last year
- Port of OpenAI's Whisper model in C/C++β45,353Updated last week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.β76,990Updated 7 months ago