facebookresearch / audiocraftLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
β22,473Updated 6 months ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- π Text-Prompted Generative Audio Modelβ38,502Updated last year
- StableLM: Stability AI Language Modelsβ15,806Updated last year
- Stable diffusion for real-time music generationβ3,788Updated last year
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Headβ10,193Updated last year
- Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorchβ3,275Updated 2 years ago
- Generative Models by Stability AIβ26,371Updated 3 months ago
- High-Resolution Image Synthesis with Latent Diffusion Modelsβ41,740Updated 2 months ago
- β7,842Updated last year
- AudioLDM: Generate speech, sound effects, music and beyond, with text.β2,736Updated 2 months ago
- Community interface for generative AIβ9,039Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,458Updated 3 months ago
- Universal LLM Deployment Engine with ML Compilationβ21,346Updated this week
- Official implementation of AnimateDiff.β11,758Updated last year
- High-performance In-browser LLM Inference Engineβ16,427Updated 4 months ago
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,660Updated 10 months ago
- State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.β3,782Updated last year
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,466Updated last year
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β30,742Updated this week
- Text-to-Audio/Music Generationβ2,492Updated 11 months ago
- Create π₯ videos with Stable Diffusion by exploring the latent space and morphing between text promptsβ4,626Updated 11 months ago
- so-vits-svc fork with realtime support, improved interface and more features.β9,118Updated this week
- An unofficial PyTorch implementation of the audio LM VALL-Eβ2,989Updated 2 years ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,633Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,957Updated last year
- π€ Assemble, configure, and deploy autonomous AI Agents in your browser.β34,911Updated 4 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,917Updated last year
- Generative models for conditional audio generationβ3,431Updated 2 months ago
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdfβ24,347Updated last month
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)β25,742Updated last year
- ImageBind One Embedding Space to Bind Them Allβ8,791Updated last week