facebookresearch / audiocraftLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
β22,042Updated 2 months ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- π Text-Prompted Generative Audio Modelβ37,909Updated 9 months ago
- StableLM: Stability AI Language Modelsβ15,832Updated last year
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Headβ10,144Updated 10 months ago
- Universal LLM Deployment Engine with ML Compilationβ20,685Updated 3 weeks ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β38,670Updated last week
- A Gradio web UI for Large Language Models with support for multiple inference backends.β43,761Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,867Updated last year
- Stable diffusion for real-time music generationβ3,694Updated 10 months ago
- Community interface for generative AIβ8,983Updated last year
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creatβ¦β25,202Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β15,944Updated 3 weeks ago
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdfβ24,156Updated 8 months ago
- β7,814Updated last year
- Inference code for Llama modelsβ58,290Updated 4 months ago
- Instruct-tune LLaMA on consumer hardwareβ18,904Updated 10 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,411Updated 9 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β22,643Updated 9 months ago
- Code to accompany "A Method for Animating Children's Drawings of the Human Figure"β12,476Updated last month
- π¦π Build context-aware reasoning applicationsβ108,279Updated this week
- ImageBind One Embedding Space to Bind Them Allβ8,663Updated 10 months ago
- Running large language models on a single GPU for throughput-oriented scenarios.β9,320Updated 7 months ago
- High-performance In-browser LLM Inference Engineβ15,547Updated 3 weeks ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,019Updated 10 months ago
- Port of OpenAI's Whisper model in C/C++β40,358Updated this week
- π A list of open LLMs available for commercial use.β12,052Updated 3 months ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,362Updated 9 months ago
- Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorchβ3,254Updated last year
- Locally run an Instruction-Tuned Chat-Style LLMβ10,229Updated 2 years ago
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus oβ¦β175,738Updated this week
- Official implementation of AnimateDiff.β11,434Updated 10 months ago