facebookresearch / audiocraftLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆22,515Updated 6 months ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- 🔊 Text-Prompted Generative Audio Model☆38,561Updated last year
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,191Updated last year
- Generate 3D objects conditioned on text or images☆12,077Updated last year
- Community interface for generative AI☆9,027Updated last year
- ☆7,837Updated last year
- Stable diffusion for real-time music generation☆3,802Updated last year
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆25,964Updated this week
- A multi-voice TTS system trained with an emphasis on quality☆14,623Updated 10 months ago
- StableLM: Stability AI Language Models☆15,803Updated last year
- Let us control diffusion models!☆33,128Updated last year
- Official Code for DragGAN (SIGGRAPH 2023)☆35,985Updated last year
- Text-to-Audio/Music Generation☆2,502Updated last year
- WebUI extension for ControlNet☆17,814Updated last year
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,804Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,923Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,746Updated last year
- Code to accompany "A Method for Animating Children's Drawings of the Human Figure"☆12,692Updated last month
- The simplest way to run LLaMA on your local machine☆13,049Updated last year
- AudioLDM: Generate speech, sound effects, music and beyond, with text.☆2,744Updated 3 months ago
- The definitive Web UI for local AI, with powerful features and easy setup.☆45,125Updated 2 weeks ago
- Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch☆3,281Updated 2 years ago
- Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts☆4,621Updated last year
- Stable Diffusion with Core ML on Apple Silicon☆17,630Updated 3 months ago
- ImageBind One Embedding Space to Bind Them All☆8,800Updated this week
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,333Updated last month
- High-Resolution Image Synthesis with Latent Diffusion Models☆41,816Updated 3 months ago
- Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功…☆4,982Updated 2 years ago
- Stable Diffusion web UI☆157,046Updated 3 weeks ago
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆30,995Updated this week
- Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.☆8,731Updated last year