AhmedZgaren / SaveLinks
☆31Updated last week
Alternatives and similar repositories for Save
Users that are interested in Save are comparing it to the libraries listed below
Sorting:
- VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models☆36Updated 6 months ago
- [T-PAMI 2025] EMOv2: Pushing 5M Vision Model Frontier☆49Updated 9 months ago
- 3D Traffic Light & Sign Dataset☆20Updated 6 months ago
- ☆70Updated last year
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Updated 11 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- CVPR 2025 Workshop on CVEU.☆42Updated 3 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆132Updated last year
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆40Updated last year
- ☆26Updated 11 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆58Updated last year
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆162Updated 2 weeks ago
- Multiple Transformation Function Estimation for Image Enhancement☆22Updated 11 months ago
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆124Updated 4 months ago
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆327Updated 2 weeks ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆50Updated last year
- Efficient Track Anything☆650Updated 9 months ago
- TensorFlow implementation of a comprehensive comparison of various SSL (Semi-Supervised Learning) approaches in image segmentation, featu…☆19Updated 11 months ago
- Make Your Training Flexible: Towards Deployment-Efficient Video Models☆29Updated 4 months ago
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆62Updated 2 weeks ago
- FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆14Updated 8 months ago
- XmodelLM☆39Updated 10 months ago
- ☆192Updated 4 months ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆360Updated last year
- Focusing on Tracks for Online Multi-Object Tracking☆75Updated 2 weeks ago
- Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model☆120Updated 2 months ago
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆32Updated last year
- [ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning☆395Updated 3 weeks ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆58Updated 9 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆59Updated 7 months ago