hasibzunair / peekaboo2Links
Official code for PEEKABOO2: Adapting Peekaboo with Segment Anything Model for Unsupervised Object Localization in Images and Videos.
β29Updated 3 weeks ago
Alternatives and similar repositories for peekaboo2
Users that are interested in peekaboo2 are comparing it to the libraries listed below
Sorting:
- Let's bake an image.β15Updated last week
- Inference and fine-tuning examples for vision models from π€ Transformersβ162Updated 3 months ago
- Using the moondream VLM with optical flow for promptable object trackingβ71Updated 9 months ago
- Take your LLM to the optometrist.β42Updated this week
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β84Updated last year
- Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!β149Updated 2 years ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β92Updated last week
- β34Updated last year
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vβ¦β125Updated 5 months ago
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"β815Updated 2 weeks ago
- Practical Python exercises on classical computer vision and clean engineering practicesβ22Updated 7 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.β67Updated last year
- β107Updated 5 months ago
- βYOLOLite β lightweight YOLO in PyTorch. ONNX export + CPU inference (Raspberry Pi friendly).ββ40Updated last week
- Solving Computer Vision with AI agentsβ34Updated 4 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of modelsβ274Updated 4 months ago
- Paper Piano uses Python and OpenCV to detect key presses on a hand-drawn piano, translating them into digital notes and sound.β42Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.β65Updated 2 years ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.β46Updated 10 months ago
- Mapping ping with a simple script and Ordinary Kriging to interpolate sparse measurements into a nice visualization!β79Updated last year
- β56Updated last year
- From scratch implementation of a vision language model in pure PyTorchβ251Updated last year
- EyeTrax β webcam-based eye tracking made simpleβ211Updated 2 months ago
- Salient feature extractor based on yoloV8β72Updated 2 years ago
- β120Updated 5 months ago
- Fine tune Gemma 3 on an object detection taskβ89Updated 4 months ago
- π | UniFace: A Comprehensive Library for Face Detection, Recognition, Landmark Analysis, Age, and Gender Detection.β246Updated 2 weeks ago
- Experiment and integrate with different OCR frameworks seamlesslyβ103Updated last year
- Notebooks for fine tuning pali gemmaβ117Updated 7 months ago
- Create topological graph for image segments.β22Updated last year