binbinjiang / CVT-SLRLinks
Official code of CVPR 2023 Highlight paper CVT-SLR
☆179Updated last year
Alternatives and similar repositories for CVT-SLR
Users that are interested in CVT-SLR are comparing it to the libraries listed below
Sorting:
- Latest AI Sign Language Papers & Survey & Review☆123Updated 11 months ago
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆129Updated last year
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆96Updated 4 months ago
- Code for ICCV 2025 paper - Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-T…☆96Updated 3 weeks ago
- [NeurIPS 2024] Referring Human Pose and Mask Estimation In the Wild☆43Updated 7 months ago
- Voice-Face Association Learning Evaluation☆49Updated last year
- Domain Prompt Learning with Quaternion Networks (CVPR2024 Highlight)☆79Updated 8 months ago
- This repository is the official implementation of GaTector, which studies the newly proposed task, gaze object prediction. In this work, …☆58Updated last year
- Official repository of Expert-Controlled Classifier-Free Guidance for Reliable Medical Visual Question Answering.☆41Updated last month
- ☆49Updated 5 years ago
- [ICLR'2025 Spotlight] Official repository for "SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding"☆69Updated 4 months ago
- ☆89Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆157Updated 10 months ago
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆113Updated 10 months ago
- [CVPR 2024] Interactive continual learning: Fast and slow thinking☆102Updated last year
- Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]☆116Updated last year
- PyTorch code for BagFormer: Better Cross-Modal Retrieval via bag-wise interaction☆99Updated 2 years ago
- ☆99Updated 2 years ago
- DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding☆70Updated 3 months ago
- [TCSVT 2024] Official PyTorch implementation of the paper "MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Hum…☆26Updated last year
- This script allows the server to isolate computational resources through LXD and pre-install PyTorch in order to share GPUs among differe…☆92Updated last year
- Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.☆123Updated 2 months ago
- The official implementation of BackTAL, TPAMI 2021.☆167Updated 3 years ago
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆121Updated last week
- IGANet, single-frame based 3D human pose estimation☆51Updated 2 years ago
- Official Implementation of NeurIPS 2023 Contextually Affinitive Neighborhood Refinery for Deep Clustering☆46Updated last year
- MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.☆34Updated 2 years ago
- [ICLR 25] The implementation of paper Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision.☆52Updated last month
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆49Updated 6 months ago
- This is the source code for paper "Unsupervised Adversarial Domain Adaptation for Cross-domain Face Presentation Attack Detection"☆77Updated 4 years ago