AhmedZgaren / SaveLinks
☆30Updated 6 months ago
Alternatives and similar repositories for Save
Users that are interested in Save are comparing it to the libraries listed below
Sorting:
- VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models☆36Updated 4 months ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆40Updated last year
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Updated 9 months ago
- ☆69Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆57Updated last year
- 3D Traffic Light & Sign Dataset☆19Updated 5 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- [T-PAMI 2025] EMOv2: Pushing 5M Vision Model Frontier☆48Updated 8 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆127Updated last year
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆124Updated 2 months ago
- Efficient Track Anything☆625Updated 7 months ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆154Updated 4 months ago
- [NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim☆343Updated 6 months ago
- CVPR 2025 Workshop on CVEU.☆42Updated 2 months ago
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆309Updated 2 months ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆358Updated last year
- Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model☆108Updated 3 weeks ago
- CAVIS: Context-Aware Video Instance Segmentation☆89Updated last month
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆50Updated 11 months ago
- Focusing on Tracks for Online Multi-Object Tracking☆67Updated last month
- Multiple Transformation Function Estimation for Image Enhancement☆22Updated 10 months ago
- ☆19Updated 3 months ago
- ☆193Updated 3 months ago
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"☆103Updated 2 months ago
- [IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio☆308Updated 2 months ago
- FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆14Updated 6 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- XmodelLM☆39Updated 9 months ago
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆77Updated 2 months ago
- DETRPose: Real-time end-to-end transformer model for multi-person pose estimation☆28Updated 2 months ago