facebookresearch / audiocraftLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
β22,430Updated 5 months ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- π Text-Prompted Generative Audio Modelβ38,441Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,654Updated 9 months ago
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Headβ10,191Updated last year
- StableLM: Stability AI Language Modelsβ15,811Updated last year
- AudioLDM: Generate speech, sound effects, music and beyond, with text.β2,730Updated 2 months ago
- Stable diffusion for real-time music generationβ3,778Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,913Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,049Updated 3 months ago
- Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorchβ3,275Updated last year
- Official implementation of AnimateDiff.β11,715Updated last year
- Universal LLM Deployment Engine with ML Compilationβ21,259Updated this week
- LLM UI with advanced features, easy setup, and multiple backend support.β44,845Updated this week
- Generate 3D objects conditioned on text or imagesβ12,052Updated last year
- β7,847Updated last year
- Text-to-Audio/Music Generationβ2,484Updated 11 months ago
- State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.β3,771Updated last year
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ42,340Updated last year
- Generative models for conditional audio generationβ3,411Updated last month
- Inference code for Llama modelsβ58,685Updated 7 months ago
- WebUI extension for ControlNetβ17,785Updated last year
- [CVPR 2023] SadTalkerοΌLearning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animationβ13,178Updated last year
- Let us control diffusion models!β32,986Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,457Updated 2 months ago
- Stable diffusion for real-time music generation (web app)β2,668Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,648Updated last year
- LlamaIndex is the leading framework for building LLM-powered agents over your data.β44,018Updated this week
- ImageBind One Embedding Space to Bind Them Allβ8,779Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Modelsβ4,943Updated last year
- Community interface for generative AIβ9,042Updated last year
- An unofficial PyTorch implementation of the audio LM VALL-Eβ2,986Updated 2 years ago