AhmedZgaren / Save
☆27Updated 3 months ago
Alternatives and similar repositories for Save
Users that are interested in Save are comparing it to the libraries listed below
Sorting:
- 3D Traffic Light & Sign Dataset☆18Updated last month
- EMOv2: Pushing 5M Vision Model Frontier☆46Updated 4 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 2 months ago
- VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models☆32Updated last month
- ☆68Updated 10 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆123Updated 9 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 8 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆55Updated 5 months ago
- Make Your Training Flexible: Towards Deployment-Efficient Video Models☆27Updated 2 weeks ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆39Updated 11 months ago
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Updated last month
- CVPR 2025 Workshop on CVEU.☆39Updated last month
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆122Updated 7 months ago
- Multiple Transformation Function Estimation for Image Enhancement☆22Updated 6 months ago
- EdgeSAM model for use with Autodistill.☆26Updated 11 months ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Updated 6 months ago
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆104Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 9 months ago
- ☆25Updated 2 months ago
- FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆14Updated 3 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆63Updated 9 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆56Updated last year
- [ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient☆30Updated last year
- Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".☆56Updated last year
- ☆24Updated last year
- LiVOS: Light Video Object Segmentation with Gated Linear Matching (CVPR 2025)☆34Updated last month
- Codebase for the Recognize Anything Model (RAM)☆78Updated last year
- Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆59Updated 2 months ago
- Multi-vision Sensor Perception and Reasoning (MS-PR) benchmark, assessing VLMs on their capacity for sensor-specific reasoning.☆15Updated 2 months ago