Event-AHU / VFM-Det
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
☆29Updated this week
Alternatives and similar repositories for VFM-Det:
Users that are interested in VFM-Det are comparing it to the libraries listed below
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆17Updated last month
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆34Updated 9 months ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆28Updated 3 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆18Updated 3 months ago
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Updated 4 months ago
- Public repository for the ECCV 2024 paper "Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation".☆22Updated 4 months ago
- LiVOS: Light Video Object Segmentation with Gated Linear Matching☆25Updated 3 months ago
- Wonderful Matrices to Build Small Language Models☆44Updated last week
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated 10 months ago
- ☆24Updated last year
- This repository contains the official implementation for the paper "RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark".☆25Updated 3 weeks ago
- Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆40Updated last month
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 6 months ago
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆36Updated this week
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆31Updated 3 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆19Updated 3 months ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆37Updated 9 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆49Updated 3 weeks ago
- Multi-vision Sensor Perception and Reasoning (MS-PR) benchmark, assessing VLMs on their capacity for sensor-specific reasoning.☆13Updated last month
- XmodelLM☆39Updated 3 months ago
- Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".☆51Updated 9 months ago
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆41Updated last month
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆33Updated 7 months ago
- Adaptive Inter-Class Similarity Distillation for Semantic Segmentation (MTAP 2025)☆25Updated last week
- [AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues☆51Updated 2 months ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Updated 9 months ago
- Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA top 1-acc b…☆18Updated 2 months ago
- EMOv2: Pushing 5M Vision Model Frontier☆43Updated last month