vrizawahyu22 / juggling_counting
☆60Updated last year
Related projects ⓘ
Alternatives and complementary repositories for juggling_counting
- Eye exploration☆22Updated this week
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆65Updated 11 months ago
- Vehicle speed estimation using YOLOv8☆30Updated 7 months ago
- ☆13Updated 11 months ago
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag…☆100Updated last year
- EdgeSAM model for use with Autodistill.☆25Updated 5 months ago
- Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!☆148Updated last year
- ☆27Updated 10 months ago
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densi…☆20Updated 8 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆77Updated last year
- Flask-based web application designed to compare text and image embeddings using the CLIP model.☆22Updated 9 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆59Updated 3 months ago
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆124Updated 2 weeks ago
- A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimation☆45Updated last month
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm…☆119Updated this week
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆32Updated 2 months ago
- ☆30Updated 9 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- ☆45Updated 9 months ago
- ☆66Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆47Updated this week
- an optimized, production-ready implementation of active speaker detection☆54Updated 5 months ago
- Simple CogVLM client script☆14Updated 11 months ago
- Notebooks using the Neural Magic libraries 📓☆41Updated 3 months ago
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826☆52Updated last month
- Computer Vision Helping Library☆12Updated 2 weeks ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆92Updated 3 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆65Updated 6 months ago
- ☆29Updated last month
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated last month