OpenGVLab / InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
☆3,191Updated 3 weeks ago
Related projects: ⓘ
- [CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.☆2,977Updated 2 weeks ago
- Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功…☆4,994Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆4,465Updated 2 months ago
- Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)☆3,277Updated 6 months ago
- Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold☆2,162Updated last year
- [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation☆4,198Updated 10 months ago
- [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators☆3,973Updated last year
- ☆7,642Updated 5 months ago
- ImageBind One Embedding Space to Bind Them All☆8,221Updated last month
- Inpaint anything using Segment Anything and inpainting models.☆6,305Updated 6 months ago
- Official repo for consistency models.☆6,073Updated 5 months ago
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,155Updated 6 months ago
- Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI…☆6,407Updated 3 months ago
- [EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding☆2,708Updated 3 months ago
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆8,881Updated last month
- Open source short video automatic generation tool☆2,660Updated last year
- Open-source and strong foundation image recognition models.☆2,745Updated last month
- Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model☆3,200Updated 7 months ago
- [NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"☆4,309Updated last month
- Instruction Tuning with GPT-4☆4,165Updated last year
- Community interface for generative AI☆8,677Updated 4 months ago
- Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with dive…☆1,662Updated last year
- Official implementation of AnimateDiff.☆10,270Updated last month
- ModelScope: bring the notion of Model-as-a-Service to life.☆6,794Updated this week
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,691Updated 6 months ago
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆6,960Updated 5 months ago
- T2I-Adapter☆3,402Updated 2 months ago
- An open-source framework for training large multimodal models.☆3,658Updated 2 weeks ago
- FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.☆3,813Updated 4 months ago
- ☆3,445Updated 4 months ago