LeanFly / Grounded-Segment-Anything-API
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect , Segment and Generate Anything with Image and Text Inputs
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Grounded-Segment-Anything-API
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆34Updated last year
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆31Updated last month
- ☆29Updated 11 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆65Updated 11 months ago
- ☆30Updated 10 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated 7 months ago
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆23Updated 10 months ago
- ☆19Updated last year
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆74Updated 2 years ago
- Finetune any model on HF in less than 30 seconds☆56Updated this week
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago
- ☆40Updated 7 months ago
- Luann allows you to create a LLM agent,which has complete memory module (long-term memory, short-term memory) and knowledge module(Variou…☆16Updated last week
- An Android app running inference on Meta's Segment-Anything (SAM) and SAM v2☆20Updated 2 months ago
- Gradio UI for a Cog API☆64Updated 7 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆15Updated 9 months ago
- Super simple Streamlit app for playing with Stable Diffusion 2 and Stable Diffusion XL 1.0☆24Updated 2 months ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆17Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆28Updated 10 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated last month
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆22Updated 10 months ago
- ☆18Updated 7 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- ☆15Updated last year
- ☆28Updated 10 months ago
- ☆12Updated 10 months ago
- Notebooks using the Neural Magic libraries 📓☆41Updated 3 months ago
- BH hackathon☆14Updated 7 months ago