motern88 / HiLightLinks
Video Language Model for Motern AI
☆9Updated 7 months ago
Alternatives and similar repositories for HiLight
Users that are interested in HiLight are comparing it to the libraries listed below
Sorting:
- official repository for ATM-Traffic☆10Updated last month
- Vision Mamba 2: More Efficient Visual Representation Learning with State Space Duality☆26Updated 11 months ago
- [CVPR 25] Official Implementation (Pytorch) of "EfficientViM: Efficient Vision Mamba with Hidden State Mixer-based State Space Duality"☆64Updated last month
- ☆17Updated 7 months ago
- This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acceleration☆14Updated 2 months ago
- This project is based on Vim (paper, code) and we appreciate this excellent work.☆13Updated 4 months ago
- We use Raspberry PI and STM32 to build a pet emotion recognition balance robot, which has excellent movement ability and robust emotion r…☆10Updated 5 months ago
- AFFNet-Unofficial Implementation☆15Updated last year
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆64Updated 3 weeks ago
- [NAACL 2025 Main Conference] PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization☆18Updated 2 months ago
- Official repository of MLLA (NeurIPS 2024)☆328Updated 6 months ago
- (AAAI 2025) Official PyTorch implementation of paper "SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection".☆14Updated 3 weeks ago
- ☆26Updated 6 months ago
- Hybrid Mamba for Few-Shot Segmentation (NIPS 2024)☆29Updated 8 months ago
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆69Updated 5 months ago
- A novel spatial multi-modal omics framework, named PRototype-Aware Graph Adaptative aggregation (PRAGA) for spatial multi-modal omics ana…☆10Updated last week
- Source code for AAAI 2025 paper: FSTA-SNN:Frequency-based Spatial-Temporal Attention Module for Spiking Neural Networks☆25Updated 3 months ago
- A curasted list of papers with the topic of Diffusion Models for Multi-Modal☆28Updated last year
- [CVPR 2024] MFP: Making Full Use of Probability Maps for Interactive Image Segmentation☆15Updated 10 months ago
- Official implementation for P2SAM (ACM MM 2024)☆10Updated 6 months ago
- SWUFE 西南财经大学 LaTeX 本科毕业论文模板,适用于 Overleaf☆10Updated last month
- A Comprehensive Survey on Knowledge Distillation☆36Updated 2 months ago
- [CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…☆17Updated 3 months ago
- New generation of CLIP with fine grained discrimination capability, ICML2025☆180Updated 2 weeks ago
- Robust End-to-end Point-Supervised Tiny Object Detection☆8Updated last month
- A collection of papers, datasets, benchmarks, code, and model weights for Remote Sensing Cross-Modal Image-Text Retrieval (RSCMIT).☆17Updated 3 months ago
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆91Updated 3 weeks ago
- ☆30Updated 2 months ago
- This is a laboratory code of paper---MMDRFuse: Distilled Mini-Model with Dynamic Refresh for Multi-Modality Image Fusion☆23Updated 9 months ago
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆141Updated 3 months ago