LiWentomng / gradio-osprey-demoLinks
Gradio demo used in our Osprey:Pixel Understanding with Visual Instruction Tuning.
☆16Updated 2 years ago
Alternatives and similar repositories for gradio-osprey-demo
Users that are interested in gradio-osprey-demo are comparing it to the libraries listed below
Sorting:
- ☆20Updated 2 years ago
- Codebase for the Recognize Anything Model (RAM)☆88Updated 2 years ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Updated last year
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆99Updated last year
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆109Updated last month
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning☆131Updated 6 months ago
- ☆198Updated 7 months ago
- WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens☆201Updated last year
- ☆34Updated last year
- ☆71Updated 2 years ago
- Image Editing Anything☆116Updated 2 years ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆53Updated 8 months ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆20Updated 3 years ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Updated last year
- Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).☆159Updated last year
- Image Prompter for Gradio☆92Updated 2 years ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆128Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆136Updated last year
- Adobe-EntitySeg dataset☆43Updated 2 years ago
- Official Implementation of ICCV 2023 Paper - SegPrompt: Boosting Open-World Segmentation via Category-level Prompt Learning☆111Updated 7 months ago
- ☆58Updated 2 years ago
- ☆94Updated 10 months ago
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆85Updated 2 years ago
- Official PyTorch implementation for TCSVT 23 "Detect Any Shadow: Segment Anything for Video Shadow Detection"☆64Updated last year
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆108Updated last year
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated last year
- Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).☆85Updated 2 years ago
- ☆36Updated last year
- Precision Search through Multi-Style Inputs☆73Updated 5 months ago
- Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"☆120Updated 3 months ago