SkalskiP / segment-anything-2Links
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
β12Updated last year
Alternatives and similar repositories for segment-anything-2
Users that are interested in segment-anything-2 are comparing it to the libraries listed below
Sorting:
- β17Updated last year
- Notebooks using the Neural Magic libraries πβ39Updated last year
- Awesome LLM application repoβ87Updated 9 months ago
- YOLOv10: Real-Time End-to-End Object Detectionβ11Updated last year
- Medical Mixture of Experts LLM using Mergekit.β20Updated last year
- β22Updated last year
- Streamlit app presented to the Streamlit LLMs Hackathon September 23β16Updated last year
- YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smoothβ¦β58Updated last year
- This template demonstrates how to create a collaborative team of AI agents that work together to process, analyze, and generate insights β¦β49Updated 11 months ago
- β15Updated 2 years ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.β65Updated 2 years ago
- β22Updated last year
- Eye explorationβ31Updated 3 weeks ago
- Multimodal AI App using Llava 7B and Gradio.β39Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale β¦β20Updated 2 months ago
- CrewAI AgentOps: Monitor your AI Agentsβ19Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.β49Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β93Updated last week
- Inference and fine-tuning examples for vision models from π€ Transformersβ162Updated 4 months ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.β21Updated last year
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imagβ¦β118Updated 2 years ago
- Streamlit application that helps users analyze RFP's using the latest Gemini 2.0 Flash Experimental LLM.β19Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creβ¦β23Updated last year
- Passively collect images for computer vision datasets on the edge.β35Updated 2 years ago
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.β29Updated 10 months ago
- An agent to generate stunning images :)β23Updated 7 months ago
- Get the information of a Github Repository using the power of LLM.β53Updated 2 years ago
- β13Updated last year
- GPT-4V(ision) module for use with Autodistill.β25Updated last year
- Agentic RAG using Crew AI.β30Updated last year