huggingface / segment-anything-2Links
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
☆73Updated 8 months ago
Alternatives and similar repositories for segment-anything-2
Users that are interested in segment-anything-2 are comparing it to the libraries listed below
Sorting:
- SmolVLM2 Demo☆154Updated 3 months ago
- mlx image models for Apple Silicon machines☆80Updated 2 months ago
- ☆351Updated 8 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 8 months ago
- Swift implementation of Flux.1 using mlx-swift☆84Updated 6 months ago
- CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on A…☆79Updated 11 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆81Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆96Updated 6 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆174Updated 3 weeks ago
- ☆48Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 9 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆72Updated last week
- Notebooks to demonstrate TimmWrapper☆16Updated 5 months ago
- Examples using MLX Swift☆13Updated 2 months ago
- Swift Core ML Examples☆221Updated 6 months ago
- Montelimar - Extract text from anywhere☆78Updated last month
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆47Updated 9 months ago
- Open-source and reproducible benchmarks for Speaker Diarization☆27Updated this week
- Gradio app to track objects in video and add visual effects☆16Updated last month
- ☆22Updated 8 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 7 months ago
- run embeddings in MLX☆90Updated 8 months ago
- Run large models from the terminal using Apple MLX.☆30Updated last year
- Gradio UI for a Cog API☆68Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆54Updated last week
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- ☆29Updated last month
- ☆58Updated last year
- Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments☆41Updated 4 months ago
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆76Updated 3 years ago