AhmedZgaren / SaveLinks
☆29Updated 5 months ago
Alternatives and similar repositories for Save
Users that are interested in Save are comparing it to the libraries listed below
Sorting:
- VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models☆35Updated 3 months ago
- EMOv2: Pushing 5M Vision Model Frontier☆46Updated 6 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆125Updated 11 months ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Updated 8 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆56Updated last year
- Focusing on Tracks for Online Multi-Object Tracking☆51Updated 2 weeks ago
- ☆68Updated last year
- 3D Traffic Light & Sign Dataset☆19Updated 3 months ago
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"☆97Updated last month
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆288Updated 2 weeks ago
- Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model☆105Updated last week
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆40Updated last year
- Make Your Training Flexible: Towards Deployment-Efficient Video Models☆30Updated last month
- XmodelLM☆39Updated 7 months ago
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆118Updated last month
- CVPR 2025 Workshop on CVEU.☆41Updated last month
- EdgeSAM model for use with Autodistill.☆27Updated last year
- Multi-vision Sensor Perception and Reasoning (MS-PR) benchmark, assessing VLMs on their capacity for sensor-specific reasoning.☆16Updated 4 months ago
- ☆23Updated 9 months ago
- Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.☆49Updated last week
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆64Updated 11 months ago
- Efficient Track Anything☆586Updated 6 months ago
- ☆191Updated last month
- ☆19Updated 2 months ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆355Updated 10 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 4 months ago
- ☆41Updated 5 months ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆140Updated 3 months ago
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Updated 3 months ago
- Take your LLM to the optometrist.☆33Updated last week