facebookresearch / audiocraftLinks
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆22,385Updated 5 months ago
Alternatives and similar repositories for audiocraft
Users that are interested in audiocraft are comparing it to the libraries listed below
Sorting:
- 🔊 Text-Prompted Generative Audio Model☆38,371Updated 11 months ago
- StableLM: Stability AI Language Models☆15,825Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,910Updated last year
- Stable diffusion for real-time music generation☆3,773Updated last year
- ☆7,843Updated last year
- LLM UI with advanced features, easy setup, and multiple backend support.☆44,681Updated this week
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,440Updated 11 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,627Updated 9 months ago
- Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch☆3,275Updated last year
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,190Updated last year
- Community interface for generative AI☆9,023Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,745Updated 11 months ago
- Official implementation of AnimateDiff.☆11,663Updated last year
- Text-to-Audio/Music Generation☆2,482Updated 10 months ago
- A latent text-to-image diffusion model☆71,331Updated last year
- AudioLDM: Generate speech, sound effects, music and beyond, with text.☆2,723Updated last month
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,983Updated 2 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆13,088Updated last year
- Code to accompany "A Method for Animating Children's Drawings of the Human Figure"☆12,655Updated 3 months ago
- Generative models for conditional audio generation☆3,396Updated last month
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,444Updated 2 months ago
- Inference code for CodeLlama models☆16,354Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.☆30,271Updated this week
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆25,702Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervision☆86,321Updated last month
- Generative Models by Stability AI☆26,269Updated 2 months ago
- State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.☆3,759Updated last year
- A multi-voice TTS system trained with an emphasis on quality☆14,494Updated 8 months ago
- Universal LLM Deployment Engine with ML Compilation☆21,099Updated this week
- Inference code for Llama models☆58,621Updated 6 months ago