binbinjiang / CVT-SLRLinks
Official code of CVPR 2023 Highlight paper CVT-SLR
☆179Updated last year
Alternatives and similar repositories for CVT-SLR
Users that are interested in CVT-SLR are comparing it to the libraries listed below
Sorting:
- Latest AI Sign Language Papers & Survey & Review☆120Updated 10 months ago
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆128Updated 11 months ago
- Official repository of Expert-Controlled Classifier-Free Guidance for Reliable Medical Visual Question Answering.☆40Updated 2 weeks ago
- [ICLR'2025 Spotlight] Official repository for "SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding"☆69Updated 3 months ago
- Domain Prompt Learning with Quaternion Networks (CVPR2024 Highlight)☆81Updated 7 months ago
- ☆49Updated 5 years ago
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆98Updated 4 months ago
- [NeurIPS 2024] Referring Human Pose and Mask Estimation In the Wild☆43Updated 7 months ago
- ☆100Updated last year
- Voice-Face Association Learning Evaluation☆48Updated last year
- DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding☆70Updated 2 months ago
- ☆89Updated last year
- Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]☆116Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆158Updated 9 months ago
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆111Updated 9 months ago
- ☆45Updated 3 months ago
- Code for ICCV 2025 paper - Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-T…☆96Updated 2 weeks ago
- [CVPR 2024] Interactive continual learning: Fast and slow thinking☆102Updated last year
- PyTorch code for BagFormer: Better Cross-Modal Retrieval via bag-wise interaction☆100Updated 2 years ago
- This repository is the official implementation of GaTector, which studies the newly proposed task, gaze object prediction. In this work, …☆58Updated last year
- The official implementation of BackTAL, TPAMI 2021.☆168Updated 3 years ago
- ☆84Updated 9 months ago
- (NeurIPS‘24) LLM4EA: <Entity Alignment with Noisy Annotations from Large Language Models>☆57Updated 6 months ago
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆49Updated 5 months ago
- linkedin, seek job information crawler☆105Updated 3 months ago
- Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition☆310Updated 11 months ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆199Updated last month
- This script allows the server to isolate computational resources through LXD and pre-install PyTorch in order to share GPUs among differe…☆93Updated last year
- ☆163Updated last month
- A collection of papers related to knowledge fusion☆57Updated 9 months ago