facebookresearch / audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
☆21,325Updated this week
Alternatives and similar repositories for audiocraft:
Users that are interested in audiocraft are comparing it to the libraries listed below
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,156Updated 2 months ago
- Inference code for CodeLlama models☆16,150Updated 5 months ago
- Universal LLM Deployment Engine with ML Compilation☆19,630Updated this week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.☆27,123Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆37,496Updated this week
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf☆23,871Updated 3 months ago
- ☆20,808Updated 2 months ago
- ImageBind One Embedding Space to Bind Them All☆8,476Updated 5 months ago
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,071Updated 6 months ago
- Let us control diffusion models!☆31,207Updated 10 months ago
- Community interface for generative AI☆8,896Updated 8 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆63,655Updated this week
- Generative Models by Stability AI☆25,088Updated 4 months ago
- Official implementation of AnimateDiff.☆10,863Updated 5 months ago
- 🔊 Text-Prompted Generative Audio Model☆36,678Updated 4 months ago
- Inference code for Llama models☆57,227Updated 4 months ago
- StableLM: Stability AI Language Models☆15,831Updated 9 months ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆38,057Updated this week
- Making large AI models cheaper, faster and more accessible☆39,013Updated last week
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,536Updated 4 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,751Updated 11 months ago
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆28,480Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,688Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆13,023Updated 3 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,096Updated 5 months ago
- Generate 3D objects conditioned on text or images☆11,757Updated 6 months ago
- Stable diffusion for real-time music generation☆3,469Updated 5 months ago
- ☆34,540Updated last year
- High-Resolution Image Synthesis with Latent Diffusion Models☆39,774Updated 3 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,216Updated 6 months ago