zahid-isu / DriveCLIPLinks
☆12Updated 2 months ago
Alternatives and similar repositories for DriveCLIP
Users that are interested in DriveCLIP are comparing it to the libraries listed below
Sorting:
- [ICCV2023] AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception☆42Updated last year
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆28Updated 2 years ago
- Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition☆32Updated 2 years ago
- Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024☆67Updated 7 months ago
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆52Updated last year
- BEAR: a new BEnchmark on video Action Recognition☆44Updated last year
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆63Updated 6 months ago
- Placeholder☆10Updated last year
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆46Updated 8 months ago
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆45Updated last year
- ☆14Updated 2 years ago
- A new model for gait emotion recognition☆13Updated last year
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- [CVPR2022] Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition☆25Updated last year
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆20Updated last year
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆81Updated 5 months ago
- A simple but efficient transformer model for video action recognition☆59Updated 2 years ago
- Frame Flexible Network (CVPR2023)☆56Updated 2 years ago
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆59Updated 2 years ago
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization☆57Updated 2 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆89Updated 9 months ago
- Awesome Online Action Detection☆62Updated 5 months ago
- A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition☆26Updated 2 years ago
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆45Updated last year
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆38Updated last year
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆39Updated last year
- ☆48Updated 2 years ago
- [ECCV 2024] The official repo for "SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoder…☆30Updated 11 months ago
- Multi-Modal Multi-Action Video Recognition☆7Updated 3 years ago
- [ICCV'23] Official PyTorch implementation for paper "Exploring Predicate Visual Context in Detecting Human-Object Interactions"☆81Updated 11 months ago