Vision-CAIR / MammalNetLinks
☆39Updated 7 months ago
Alternatives and similar repositories for MammalNet
Users that are interested in MammalNet are comparing it to the libraries listed below
Sorting:
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆134Updated 2 years ago
- [CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding☆147Updated 9 months ago
- ☆43Updated 3 months ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆67Updated 2 years ago
- ☆18Updated last month
- A curated list of awesome temporal action segmentation resources.☆216Updated last year
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆74Updated 3 months ago
- ☆11Updated 4 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆100Updated last year
- A curated list of awesome self-supervised learning methods in videos☆152Updated last month
- [ICCV 2023] Rethinking pose estimation in crowds: overcoming the detection information-bottleneck and ambiguity☆99Updated last year
- The suite of modeling video with Mamba☆278Updated last year
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆52Updated last year
- [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models☆18Updated last year
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆291Updated last year
- Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"☆20Updated last year
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆92Updated last year
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆73Updated 7 months ago
- The official implementation of our paper "Sports Video Analysis on Large-scale Data" (https://arxiv.org/abs/2208.04897)☆78Updated 2 years ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆100Updated last year
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆27Updated 7 months ago
- [ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training☆134Updated last year
- Code for Diffusion Action Segmentation (ICCV 2023)☆66Updated 2 years ago
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆27Updated 7 months ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆149Updated 2 years ago
- This repo contains the evaluation code for the INQUIRE benchmark☆53Updated 9 months ago
- ☆40Updated last year
- Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos☆27Updated last year
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆47Updated last year