open-mmlab / mmaction2Links
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
☆4,726Updated last year
Alternatives and similar repositories for mmaction2
Users that are interested in mmaction2 are comparing it to the libraries listed below
Sorting:
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,070Updated 8 months ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,134Updated last year
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,744Updated last year
- This is an official implementation for "Video Swin Transformers".☆1,573Updated 2 years ago
- A toolbox for skeleton-based action recognition.☆1,135Updated 5 months ago
- OpenMMLab Computer Vision Foundation☆6,223Updated 4 months ago
- A deep learning library for video understanding research.☆3,467Updated 7 months ago
- OpenMMLab Pre-training Toolbox and Benchmark☆3,734Updated 9 months ago
- An open-source toolbox for action understanding based on PyTorch☆1,873Updated 3 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,558Updated last year
- ☆875Updated last year
- OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT…☆3,779Updated last year
- 3D ResNets for Action Recognition (CVPR 2018)☆4,011Updated 4 years ago
- OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.☆3,271Updated last year
- Deformable DETR: Deformable Transformers for End-to-End Object Detection.☆3,674Updated last year
- OpenMMLab Pose Estimation Toolbox and Benchmark.☆6,775Updated 3 weeks ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,638Updated 2 weeks ago
- A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.☆3,031Updated 2 years ago
- Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based a…☆1,636Updated 6 months ago
- detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.☆2,202Updated last year
- Visual tracking library based on PyTorch.☆3,412Updated last year
- [ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"☆2,598Updated last year
- PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.☆1,224Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and …☆1,889Updated 2 years ago
- You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization☆894Updated 9 months ago
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,013Updated 2 weeks ago
- [CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions☆2,718Updated 5 months ago
- [ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box☆5,539Updated last year
- VideoX: a collection of video cross-modal models☆1,038Updated last year
- A curated list of action recognition and related area resources☆3,925Updated 2 years ago