Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆32Jun 15, 2023Updated 2 years ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆123Mar 18, 2026Updated last month
- A discord bot that allows users to easily view the prompts of images that other users send☆13Oct 26, 2023Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 3 months ago
- sound stretch python module☆11May 1, 2019Updated 7 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆549Jul 25, 2023Updated 2 years ago
- ☆14Mar 12, 2023Updated 3 years ago
- ☆16Dec 19, 2023Updated 2 years ago
- Nodejs Plugin to resize GIFs☆12Mar 31, 2026Updated last month
- [CVPR 2024] Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text☆71Jun 17, 2024Updated last year
- Generative Motion Latent Flow Matching for Audio-driven Talking Portrait☆33Sep 10, 2025Updated 7 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 2 months ago
- ControlNet control image preprocess library☆15Feb 27, 2023Updated 3 years ago
- An extension of AUTOMATIC1111's webui to remove adverse noise from images.☆60Nov 6, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Remove adversarial noise from images☆85Apr 1, 2023Updated 3 years ago
- Generate images from an initial frame and text☆37Jul 28, 2023Updated 2 years ago
- Colorize your black and white images and YouTube videos for free. Streamlit application based on CNN deployed on Hugging Face.☆18Aug 31, 2024Updated last year
- Vid Driven Portrait Animation 🤢😷☆18Jul 7, 2024Updated last year
- VectorTalker: SVG Talking Face Generation with Progressive Vectorisation☆14Dec 25, 2023Updated 2 years ago
- ☆10Apr 20, 2023Updated 3 years ago
- Talking head animation☆28Dec 8, 2023Updated 2 years ago
- ComfyUI workflows to create smooth transitions between video clips using Wan VACE. Works with video from any model or other source-LTX-2,…☆81Apr 11, 2026Updated 3 weeks ago
- Simple GUI for Amphion Vevo☆14May 4, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ComfyUI custom node for generating prompts from images. Supports Qwen2.5 and Qwen3 (Instruct/Thinking) models, as well as the OpenAI API.☆24Jan 10, 2026Updated 3 months ago
- A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional…☆38Mar 17, 2026Updated last month
- Algorithmic composition of modern classical music in the twelve-tone technique.☆13May 10, 2025Updated 11 months ago
- [CVPR 2024] OHTA: One-shot Hand Avatar via Data-driven Implicit Priors☆47Jun 14, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆640Aug 15, 2024Updated last year
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- ☆43Jan 14, 2024Updated 2 years ago
- Privacy Covers for Load image, preview image and Save image nodes in comfyUI☆28Dec 3, 2025Updated 5 months ago
- ☆62Oct 13, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- BH hackathon☆14Apr 4, 2024Updated 2 years ago
- Exploring Audio Possibilities of WebGPU☆15Jan 31, 2024Updated 2 years ago
- code used on the paper Face Reconstruction with Variational Autoencoder and Face Masks https://arxiv.org/abs/2112.02139☆12Jul 23, 2024Updated last year
- LTX2 infinite length video generation Comfyui workflow based on the Stable-Video-Infinity concept and workflow☆54Jan 22, 2026Updated 3 months ago
- ☆49Jan 17, 2024Updated 2 years ago
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆19Apr 3, 2024Updated 2 years ago
- ☆10Jul 25, 2023Updated 2 years ago