facebookresearch / audiocraftLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆22,227Updated 3 months ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- 🔊 Text-Prompted Generative Audio Model☆38,114Updated 10 months ago
- Let us control diffusion models!☆32,665Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,694Updated 10 months ago
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf☆24,215Updated 9 months ago
- Generative Models by Stability AI☆26,093Updated last month
- StableLM: Stability AI Language Models☆15,828Updated last year
- ☆7,824Updated last year
- ImageBind One Embedding Space to Bind Them All☆8,712Updated 11 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,804Updated last month
- LLM UI with advanced features, easy setup, and multiple backend support.☆44,184Updated this week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.☆29,627Updated this week
- A latent text-to-image diffusion model☆71,062Updated last year
- ☆21,610Updated 7 months ago
- Inference code for Llama models☆58,445Updated 5 months ago
- Official implementation of AnimateDiff.☆11,536Updated 11 months ago
- ☆34,432Updated last year
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,930Updated last year
- Generate 3D objects conditioned on text or images☆11,959Updated last year
- High-Resolution Image Synthesis with Latent Diffusion Models☆41,262Updated last week
- Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch☆3,264Updated last year
- 🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.☆34,448Updated 2 months ago
- Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts☆4,602Updated 9 months ago
- WebUI extension for ControlNet☆17,699Updated 10 months ago
- Universal LLM Deployment Engine with ML Compilation☆20,880Updated last week
- Code to accompany "A Method for Animating Children's Drawings of the Human Figure"☆12,548Updated 2 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,579Updated 7 months ago
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆25,404Updated this week
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,373Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆22,918Updated 10 months ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,397Updated 10 months ago