Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆123Mar 18, 2026Updated last month
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆32Jun 15, 2023Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆640Aug 15, 2024Updated last year
- simple trainer for musicgen/audiocraft☆31Jul 12, 2024Updated last year
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆50Nov 11, 2025Updated 5 months ago
- ☆549Jul 25, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- fine-tuning MusicGen without prompts to generate music with a specific style☆66Jul 11, 2023Updated 2 years ago
- Extension for AUTOMATIC1111 which can generate infinite loop videos in minutes.☆49Mar 26, 2023Updated 3 years ago
- This is a cog implementation of the fine-tuner for Meta's MusicGen☆55Apr 5, 2024Updated 2 years ago
- ☆18Jan 16, 2024Updated 2 years ago
- ☆13Oct 14, 2024Updated last year
- Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning☆14Apr 2, 2025Updated last year
- A discord bot that allows users to easily view the prompts of images that other users send☆13Oct 26, 2023Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 3 months ago
- ☆11Sep 28, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆20Mar 27, 2024Updated 2 years ago
- ☆18May 4, 2025Updated last year
- Riffusion extension for AUTOMATIC1111's SD Web UI☆203Jun 5, 2023Updated 2 years ago
- A notebook running TensorRT's StableDiffusion demo on Google Colaboratory☆18Feb 1, 2023Updated 3 years ago
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆55Jan 18, 2024Updated 2 years ago
- ☆18Jan 20, 2025Updated last year
- Mustango: Toward Controllable Text-to-Music Generation☆388Jun 2, 2025Updated 11 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- Official code for SongEcho☆59Mar 3, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Sep 13, 2022Updated 3 years ago
- Fast Gabor spectral transforms in Java. Using a JNI bridge with the gaborator C++ library.☆14Jan 20, 2023Updated 3 years ago
- GUI for the new musubi-tuner☆55Jan 25, 2025Updated last year
- ☆19Jan 17, 2025Updated last year
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 3 years ago
- ☆10Nov 6, 2017Updated 8 years ago
- ☆14Mar 12, 2023Updated 3 years ago
- This repository demonstrates browser based implementation of DepthAnything and DepthAnythingV2 models. It is powered by Onnx and does not…☆36Mar 14, 2025Updated last year
- Official implementation of "Real-SRGD: Enhancing Real-World Image Super-Resolution with Classifier-Free Guided Diffusion" [ACCV2024]☆19Dec 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆123Jun 12, 2023Updated 2 years ago
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆21Sep 7, 2025Updated 7 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆34Jul 17, 2024Updated last year
- Prepare spectrograms from audio for training a Riffusion model☆16Mar 6, 2023Updated 3 years ago
- Easy to install image and video colorization using onnx converted deoldify model☆19Sep 23, 2024Updated last year
- Frontend (and soon also midleware and backend) for a new, opensource image generation platform.☆14Nov 5, 2022Updated 3 years ago
- A Jupyter widgets-based interactive notebook for Google Colab to generate images using Stable Diffusion.☆19Dec 13, 2023Updated 2 years ago