hulianyuyy / AdaptSignLinks
Improving Continuous Sign Language Recognition with Adapted Image Models
☆11Updated 2 weeks ago
Alternatives and similar repositories for AdaptSign
Users that are interested in AdaptSign are comparing it to the libraries listed below
Sorting:
- ☆64Updated last year
- ☆24Updated last year
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆36Updated last year
- ☆33Updated last year
- Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)☆29Updated last year
- CorrNet+: Sign Language Recognition and Translation via Spatial-Temporal Correlation☆27Updated 7 months ago
- An implacation of SignGraph: A Sign Sequence is Worth Graphs of Nodes (CVPR2024)☆28Updated 9 months ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Updated 2 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Updated 2 years ago
- ☆83Updated 2 years ago
- [AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning☆68Updated last year
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Updated 10 months ago
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆47Updated last year
- [ICLR'25] Official Implement of "Uni-Sign: Toward Unified Sign Language Understanding at Scale"☆86Updated last month
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13Updated 2 years ago
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆74Updated 3 months ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆60Updated last year
- [ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.☆19Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated 2 years ago
- ☆25Updated 6 months ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- 🌟 Code for ACL 2023 paper "GloFE: Gloss-Free End-to-End Sign Language Translation" (Oral)☆38Updated last year
- Code for Diffusion Action Segmentation (ICCV 2023)☆66Updated 2 years ago
- Pytorch implementation for Egoinstructor at CVPR 2024☆24Updated 9 months ago
- Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"☆44Updated 10 months ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆123Updated 2 years ago
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆52Updated last year
- ☆40Updated last year
- [CVPR2022] MS-TCT☆54Updated 2 years ago
- Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval." CVPR 2022☆112Updated 3 years ago