AIGC-Audio / AudioGPTLinks
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
☆10,210Updated last year
Alternatives and similar repositories for AudioGPT
Users that are interested in AudioGPT are comparing it to the libraries listed below
Sorting:
- ☆7,843Updated last year
- 🔊 Text-Prompted Generative Audio Model☆38,767Updated last year
- PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn …☆7,161Updated 9 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆22,719Updated 8 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,469Updated 5 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,646Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,758Updated last year
- ChatGPT interface with better UI☆3,538Updated last year
- InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBin…☆3,218Updated last year
- 🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation w…☆6,208Updated last year
- Plugins for Auto-GPT☆3,869Updated last year
- Community interface for generative AI☆9,036Updated last year
- Home of StarCoder: fine-tuning & inference!☆7,474Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,714Updated last year
- StableLM: Stability AI Language Models☆15,787Updated last year
- Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI…☆6,887Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,000Updated last year
- Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. D…☆11,976Updated last month
- AudioLDM: Generate speech, sound effects, music and beyond, with text.☆2,775Updated 5 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,972Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,236Updated last year
- Open source short video automatic generation tool☆2,801Updated 2 years ago
- An unofficial PyTorch implementation of the audio LM VALL-E☆2,992Updated 2 years ago
- MiniAGI is a simple general-purpose AI agent based on the OpenAI API.☆2,903Updated 2 years ago
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,528Updated 2 years ago
- [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators☆4,224Updated 2 years ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,287Updated 6 months ago
- 🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.☆35,286Updated 7 months ago
- Instruct-tune LLaMA on consumer hardware☆18,983Updated last year
- Text-to-Audio/Music Generation☆2,524Updated last year