zahid-isu / DriveCLIPLinks
☆12Updated 5 months ago
Alternatives and similar repositories for DriveCLIP
Users that are interested in DriveCLIP are comparing it to the libraries listed below
Sorting:
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- [ECCV 2024] The official repo for "SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoder…☆35Updated last year
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆69Updated last year
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆52Updated last year
- Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition☆32Updated 2 years ago
- Frame Flexible Network (CVPR2023)☆56Updated 2 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆100Updated last year
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆88Updated 8 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆134Updated 2 years ago
- [ICCV2023] AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception☆44Updated last year
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆46Updated last year
- Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024☆84Updated 2 weeks ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆67Updated 2 years ago
- Code for our CVPR 2021 paper Glance and Gaze: Inferring Action-aware Points for One-Stage Human-Object Interaction Detection☆29Updated 4 years ago
- [ICCV'23] Official PyTorch implementation for paper "Exploring Predicate Visual Context in Detecting Human-Object Interactions"☆85Updated last year
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization☆57Updated 2 years ago
- BEAR: a new BEnchmark on video Action Recognition☆44Updated last year
- GroupFormer☆55Updated 3 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆92Updated last year
- Memory-augmented Online Video Anomaly Detection☆16Updated last year
- [CVPR'22] Official PyTorch implementation for paper "Efficient Two-Stage Detection of Human–Object Interactions with a Novel Unary–Pairwi…☆164Updated 2 years ago
- Code for the paper, Temporal Action Localization with Enhanced Instant Discriminability☆28Updated last year
- [ICCV 2023] The official PyTorch code for Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation☆88Updated 2 years ago
- [CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detect…☆58Updated 5 months ago
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆65Updated 9 months ago
- Video Feature Enhancement with PyTorch☆30Updated 9 months ago
- Includes FSC-147-D and the code for training and testing the CounTX model from the paper Open-world Text-specified Object Counting.☆41Updated 11 months ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆28Updated 2 years ago
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆115Updated last year
- ☆19Updated 4 months ago