Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆32Jun 15, 2023Updated 3 years ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆124Mar 18, 2026Updated 3 months ago
- A discord bot that allows users to easily view the prompts of images that other users send☆13Oct 26, 2023Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 4 months ago
- sound stretch python module☆11May 1, 2019Updated 7 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆551Jul 25, 2023Updated 2 years ago
- ☆14Mar 12, 2023Updated 3 years ago
- My self-trained AI models for photorealistic image upscaling and restoration. Built on ESRGAN, SAFMN, RealPLKSR and other architectures a…☆47Jun 6, 2026Updated last week
- Nodejs Plugin to resize GIFs☆12Mar 31, 2026Updated 2 months ago
- [CVPR 2024] Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text☆71Jun 17, 2024Updated 2 years ago
- Generative Motion Latent Flow Matching for Audio-driven Talking Portrait☆33Sep 10, 2025Updated 9 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 3 months ago
- ControlNet control image preprocess library☆15Feb 27, 2023Updated 3 years ago
- An extension of AUTOMATIC1111's webui to remove adverse noise from images.☆59Nov 6, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Generate images from an initial frame and text☆37Jul 28, 2023Updated 2 years ago
- Colorize your black and white images and YouTube videos for free. Streamlit application based on CNN deployed on Hugging Face.☆16Aug 31, 2024Updated last year
- Vid Driven Portrait Animation 🤢😷☆18Jul 7, 2024Updated last year
- ☆10Apr 20, 2023Updated 3 years ago
- Talking head animation☆28Dec 8, 2023Updated 2 years ago
- ComfyUI custom node for generating prompts from images. Supports Qwen2.5 and Qwen3 (Instruct/Thinking) models, as well as the OpenAI API.☆25Jan 10, 2026Updated 5 months ago
- Simple GUI for Amphion Vevo☆14May 4, 2025Updated last year
- simple trainer for musicgen/audiocraft☆31Jul 12, 2024Updated last year
- Algorithmic composition of modern classical music in the twelve-tone technique.☆13May 10, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2024] OHTA: One-shot Hand Avatar via Data-driven Implicit Priors☆47Jun 14, 2024Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆641Aug 15, 2024Updated last year
- LLM Applications built using Streamlit, LangChain, and OpenAI API☆11Oct 7, 2023Updated 2 years ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- ☆43Jan 14, 2024Updated 2 years ago
- Privacy Covers for Load image, preview image and Save image nodes in comfyUI☆29Dec 3, 2025Updated 6 months ago
- ☆62Oct 13, 2023Updated 2 years ago
- BH hackathon☆14Apr 4, 2024Updated 2 years ago
- Exploring Audio Possibilities of WebGPU☆15Jan 31, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- code used on the paper Face Reconstruction with Variational Autoencoder and Face Masks https://arxiv.org/abs/2112.02139☆12Jul 23, 2024Updated last year
- LTX2 infinite length video generation Comfyui workflow based on the Stable-Video-Infinity concept and workflow☆56Jan 22, 2026Updated 4 months ago
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆19Apr 3, 2024Updated 2 years ago
- ☆49Jan 17, 2024Updated 2 years ago
- ☆10Jul 25, 2023Updated 2 years ago
- ☆12Feb 6, 2024Updated 2 years ago
- ComfyUI workflows to create smooth transitions between video clips using Wan VACE. Works with video from any model or other source-LTX-2,…☆92Apr 11, 2026Updated 2 months ago