LeanFly / Grounded-Segment-Anything-API
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect , Segment and Generate Anything with Image and Text Inputs
☆20Updated last year
Alternatives and similar repositories for Grounded-Segment-Anything-API:
Users that are interested in Grounded-Segment-Anything-API are comparing it to the libraries listed below
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆34Updated last year
- Gradio UI for a Cog API☆65Updated 9 months ago
- ☆29Updated last year
- Seamless Voice Interactions with LLMs☆11Updated last year
- Build Agentic workflows with function calling☆26Updated this week
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 3 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆32Updated 2 weeks ago
- ☆20Updated 10 months ago
- ☆19Updated last year
- Notebooks using the Neural Magic libraries 📓☆41Updated 5 months ago
- [WIP] AI Try-On plugin for Chrome☆26Updated 10 months ago
- Generate visual podcasts about novels using open source models☆24Updated last year
- ☆40Updated 9 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆44Updated 4 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆64Updated last year
- ☆30Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆79Updated 7 months ago
- ☆16Updated 11 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated 10 months ago
- 🚀 End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam☆27Updated 9 months ago
- A function to do all☆35Updated 9 months ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆73Updated 11 months ago
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore☆39Updated last month
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆81Updated 3 weeks ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆23Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated 10 months ago
- ☆76Updated 9 months ago
- ☆14Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 6 months ago