Event-AHU / SAFE_LargeVLM
[Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandong Jin, Yuhao Zhang, Yanlin Zhong, Yaoyang Wu, Lan Chen, Xiao Wang, Bin Luo
☆17Updated last week
Related projects ⓘ
Alternatives and complementary repositories for SAFE_LargeVLM
- ☆24Updated last year
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Updated 3 weeks ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆16Updated this week
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆35Updated 5 months ago
- ☆11Updated last year
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆30Updated 7 months ago
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆30Updated 6 months ago
- AICSD: Adaptive Inter-Class Similarity Distillation for Semantic Segmentation☆24Updated 2 months ago
- Neural network for creating distortion while keeping embeddings as close as possible☆18Updated 9 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆49Updated last month
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆15Updated 2 weeks ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated 10 months ago
- Open-source Python toolkit focused on deep learning with ordinal methodologies☆31Updated last week
- An plug in and play pipeline that utilizes segment anything to segment datasets with rich detail for downstream fine-tuning on vision mod…☆21Updated 8 months ago
- This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files☆26Updated last year
- ☆25Updated 2 months ago
- [ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient☆30Updated 11 months ago
- ☆25Updated last year
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆31Updated this week
- ☆33Updated 9 months ago
- Multiple Transformation Function Estimation for Image Enhancement☆22Updated 3 weeks ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆23Updated this week
- ☆41Updated last month
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆23Updated last month
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- [IJCAI'23] Complete Instances Mining for Weakly Supervised Instance Segmentation☆37Updated 8 months ago
- ☆12Updated 2 months ago
- ☆14Updated 8 months ago
- LiVOS: Light Video Object Segmentation with Gated Linear Matching☆10Updated this week