Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆32Jun 15, 2023Updated 2 years ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆124Mar 18, 2026Updated 2 months ago
- A discord bot that allows users to easily view the prompts of images that other users send☆13Oct 26, 2023Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated 2 years ago
- ☆551Jul 25, 2023Updated 2 years ago
- ☆14Mar 12, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2024] Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text☆71Jun 17, 2024Updated last year
- Generative Motion Latent Flow Matching for Audio-driven Talking Portrait☆33Sep 10, 2025Updated 8 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 2 months ago
- ControlNet control image preprocess library☆15Feb 27, 2023Updated 3 years ago
- An extension of AUTOMATIC1111's webui to remove adverse noise from images.☆59Nov 6, 2023Updated 2 years ago
- Remove adversarial noise from images☆84Apr 1, 2023Updated 3 years ago
- Generate images from an initial frame and text☆37Jul 28, 2023Updated 2 years ago
- Vid Driven Portrait Animation 🤢😷☆18Jul 7, 2024Updated last year
- VectorTalker: SVG Talking Face Generation with Progressive Vectorisation☆14Dec 25, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Apr 20, 2023Updated 3 years ago
- Talking head animation☆28Dec 8, 2023Updated 2 years ago
- Simple GUI for Amphion Vevo☆14May 4, 2025Updated last year
- ComfyUI custom node for generating prompts from images. Supports Qwen2.5 and Qwen3 (Instruct/Thinking) models, as well as the OpenAI API.☆24Jan 10, 2026Updated 4 months ago
- simple trainer for musicgen/audiocraft☆31Jul 12, 2024Updated last year
- [CVPR 2024] OHTA: One-shot Hand Avatar via Data-driven Implicit Priors☆47Jun 14, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆640Aug 15, 2024Updated last year
- LLM Applications built using Streamlit, LangChain, and OpenAI API☆11Oct 7, 2023Updated 2 years ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆43Jan 14, 2024Updated 2 years ago
- Privacy Covers for Load image, preview image and Save image nodes in comfyUI☆29Dec 3, 2025Updated 5 months ago
- ☆62Oct 13, 2023Updated 2 years ago
- BH hackathon☆14Apr 4, 2024Updated 2 years ago
- code used on the paper Face Reconstruction with Variational Autoencoder and Face Masks https://arxiv.org/abs/2112.02139☆12Jul 23, 2024Updated last year
- ComfyUI workflows to create smooth transitions between video clips using Wan VACE. Works with video from any model or other source-LTX-2,…☆87Apr 11, 2026Updated last month
- LTX2 infinite length video generation Comfyui workflow based on the Stable-Video-Infinity concept and workflow☆56Jan 22, 2026Updated 4 months ago
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆19Apr 3, 2024Updated 2 years ago
- ☆49Jan 17, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Jul 25, 2023Updated 2 years ago
- ☆12Feb 6, 2024Updated 2 years ago
- Easy-to-use AI tools.☆36Nov 5, 2024Updated last year
- Data Science Resources☆12Oct 2, 2024Updated last year
- Simple node to capture images from your webcam☆15Apr 16, 2025Updated last year
- GUESS: GradUally Enriching SyntheSis for Text-Driven Human Motion Generation ( IEEE Transactions on Visualization and Computer Graphics, …☆33Jan 29, 2024Updated 2 years ago
- A script for scheduling CFG scale and ETA to change during the denoising steps☆42Mar 9, 2023Updated 3 years ago