SkalskiP / segment-anything-2Links
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
☆13Updated last year
Alternatives and similar repositories for segment-anything-2
Users that are interested in segment-anything-2 are comparing it to the libraries listed below
Sorting:
- ☆16Updated last year
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆16Updated last year
- Passively collect images for computer vision datasets on the edge.☆35Updated last year
- Flask-based web application designed to compare text and image embeddings using the CLIP model.☆22Updated last year
- Notebooks using the Neural Magic libraries 📓☆40Updated last year
- Real-Time Open-Vocabulary Object Detection☆13Updated last year
- ☆22Updated last year
- Eye exploration☆28Updated 6 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆18Updated last week
- YOLOv10: Real-Time End-to-End Object Detection☆11Updated last year
- Multimodal AI App using Llava 7B and Gradio.☆40Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆47Updated last year
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- ☆21Updated 9 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 11 months ago
- Simple CogVLM client script☆14Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated 10 months ago
- ☆20Updated 7 months ago
- ☆16Updated 10 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 7 months ago
- ☆12Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated 2 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆87Updated this week
- A collection of apps powered by the LlamaIndex LLM framework.☆55Updated 5 months ago
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆133Updated 3 weeks ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- GGUF Quantization of any LLM.☆40Updated last year
- 100 Days of GPU Challenge☆21Updated 2 months ago
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag…☆117Updated 2 years ago
- Awesome LLM application repo☆86Updated 5 months ago