Vision-CAIR / MammalNetLinks
☆36Updated 4 months ago
Alternatives and similar repositories for MammalNet
Users that are interested in MammalNet are comparing it to the libraries listed below
Sorting:
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆132Updated 2 years ago
- [CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding☆143Updated 6 months ago
- Scene and animal attribute retrieval from camera trap data with domain-adapted vision-language models☆23Updated last year
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- ☆11Updated 4 years ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆81Updated 5 months ago
- ☆37Updated 3 months ago
- [ICCV 2023] Rethinking pose estimation in crowds: overcoming the detection information-bottleneck and ambiguity☆97Updated last year
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆33Updated 6 months ago
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆12Updated last year
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆61Updated 9 months ago
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆59Updated 4 months ago
- [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models☆18Updated last year
- Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"☆41Updated 7 months ago
- Code for Diffusion Action Segmentation (ICCV 2023)☆64Updated last year
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆27Updated 4 months ago
- The extension of this dataset (APTv2) can be found at:☆47Updated last year
- BEAR: a new BEnchmark on video Action Recognition☆44Updated last year
- Code for "LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model", CVPR 2024 Highlight☆45Updated last year
- This repo contains the evaluation code for the INQUIRE benchmark☆48Updated 6 months ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Updated last year
- Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23☆27Updated 5 months ago
- [ICCV'23] Official PyTorch implementation for paper "Exploring Predicate Visual Context in Detecting Human-Object Interactions"☆81Updated 11 months ago
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆39Updated last year
- A curated list of awesome self-supervised learning methods in videos☆142Updated last month
- ☆18Updated last year
- Official PyTorch repository for GRAM☆73Updated last month
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆101Updated last year
- Official PyTorch code for the CVPR 2024 paper 'Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognitio…☆32Updated last month
- [NeurIPS 2023] Self-supervised Object-Centric Learning for Videos☆27Updated 6 months ago