rkfg / audiocraftView external linksLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆32Jun 15, 2023Updated 2 years ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆123Jun 20, 2025Updated 7 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year
- ☆12Sep 26, 2023Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 3 weeks ago
- BH hackathon☆14Apr 4, 2024Updated last year
- [CVPR 2024] Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text☆71Jun 17, 2024Updated last year
- UnrealBakedSDF is a sample Unreal project for importing and visualizing BakedSDF meshes.☆15Jun 14, 2023Updated 2 years ago
- ☆12Feb 6, 2024Updated 2 years ago
- ☆549Jul 25, 2023Updated 2 years ago
- sound stretch python module☆11May 1, 2019Updated 6 years ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- Generate images from an initial frame and text☆37Jul 28, 2023Updated 2 years ago
- A discord bot that allows users to easily view the prompts of images that other users send☆13Oct 26, 2023Updated 2 years ago
- ☆15Jan 8, 2024Updated 2 years ago
- VectorTalker: SVG Talking Face Generation with Progressive Vectorisation☆15Dec 25, 2023Updated 2 years ago
- MMD viewer powered by Babylon.js and babylon-mmd☆16Aug 2, 2025Updated 6 months ago
- ☆14Oct 16, 2023Updated 2 years ago
- animatediff prompt travel☆19Jan 27, 2024Updated 2 years ago
- GUESS: GradUally Enriching SyntheSis for Text-Driven Human Motion Generation ( IEEE Transactions on Visualization and Computer Graphics, …☆33Jan 29, 2024Updated 2 years ago
- ☆17Jan 10, 2024Updated 2 years ago
- MJCF Importer Extension☆18Jul 24, 2025Updated 6 months ago
- ☆19Jul 31, 2024Updated last year
- STDFormer: Spatio Temporal Disentanglement Learning for 3D Human Mesh Recovery from Monocular Videos with Transformer☆45Mar 14, 2024Updated last year
- ☆16Apr 7, 2024Updated last year
- MCP server + embedded terminal that lets Claude Code see and edit your ComfyUI workflows☆38Jan 31, 2026Updated 2 weeks ago
- ControlNet control image preprocess library☆15Feb 27, 2023Updated 2 years ago
- ☆17Dec 28, 2023Updated 2 years ago
- ☆16Apr 23, 2024Updated last year
- [CVPR 2024] BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation☆45May 7, 2024Updated last year
- ☆43Jan 14, 2024Updated 2 years ago
- import&apply luts to images!☆17Mar 28, 2022Updated 3 years ago
- Quick lookup for Instant-angelo (https://github.com/hugoycj/Instant-angelo) results☆21Oct 22, 2023Updated 2 years ago
- ☆16Dec 19, 2023Updated 2 years ago
- Thubail maker/ image editor using PHP☆19Aug 11, 2021Updated 4 years ago
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆18Apr 3, 2024Updated last year
- Generative Motion Latent Flow Matching for Audio-driven Talking Portrait☆28Sep 10, 2025Updated 5 months ago
- [NeurIPS 2023] Official Code for "Towards Robust and Expressive Whole-body Human Pose and Shape Estimation"☆50Updated this week
- [TMLR 2025] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆18Feb 3, 2025Updated last year
- ☆24Sep 5, 2025Updated 5 months ago