facebookresearch / audiocraftLinks

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

☆22,385

Alternatives and similar repositories for audiocraft

Users that are interested in audiocraft are comparing it to the libraries listed below

Sorting:

suno-ai / bark
🔊 Text-Prompted Generative Audio Model
☆38,371Updated 11 months ago
Stability-AI / StableLM
StableLM: Stability AI Language Models
☆15,825Updated last year
Plachtaa / VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
☆7,910Updated last year
riffusion / riffusion-hobby
Stable diffusion for real-time music generation
☆3,773Updated last year
deep-floyd / IF
☆7,843Updated last year
oobabooga / text-generation-webui
LLM UI with advanced features, easy setup, and multiple backend support.
☆44,681Updated this week
LAION-AI / Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…
☆37,440Updated 11 months ago
facebookresearch / seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
☆11,627Updated 9 months ago
lucidrains / musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
☆3,275Updated last year
AIGC-Audio / AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
☆10,190Updated last year
Stability-AI / StableStudio
Community interface for generative AI
☆9,023Updated last year
Vision-CAIR / MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
☆25,745Updated 11 months ago
guoyww / AnimateDiff
Official implementation of AnimateDiff.
☆11,663Updated last year
haoheliu / AudioLDM2
Text-to-Audio/Music Generation
☆2,482Updated 10 months ago
CompVis / stable-diffusion
A latent text-to-image diffusion model
☆71,331Updated last year
haoheliu / AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
☆2,723Updated last month
lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆38,983Updated 2 months ago
OpenTalker / SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
☆13,088Updated last year
facebookresearch / AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
☆12,655Updated 3 months ago
Stability-AI / stable-audio-tools
Generative models for conditional audio generation
☆3,396Updated last month
nlpxucan / WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,444Updated 2 months ago
meta-llama / codellama
Inference code for CodeLlama models
☆16,354Updated last year
huggingface / diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
☆30,271Updated this week
invoke-ai / InvokeAI
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…
☆25,702Updated this week
openai / whisper
Robust Speech Recognition via Large-Scale Weak Supervision
☆86,321Updated last month
Stability-AI / generative-models
Generative Models by Stability AI
☆26,269Updated 2 months ago
facebookresearch / encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
☆3,759Updated last year
neonbjb / tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
☆14,494Updated 8 months ago
mlc-ai / mlc-llm
Universal LLM Deployment Engine with ML Compilation
☆21,099Updated this week
meta-llama / llama
Inference code for Llama models
☆58,621Updated 6 months ago