The AVA dataset densely annotates 80 atomic visual actions in 351k movie clips with actions localized in space and time, resulting in 1.65M action labels with multiple labels per human occurring frequently.
☆345Feb 9, 2022Updated 4 years ago
Alternatives and similar repositories for ava-dataset
Users that are interested in ava-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)☆253Oct 19, 2019Updated 6 years ago
- 国内下载google AVA dataset;download google AVA dataset in China☆71Jun 6, 2022Updated 3 years ago
- This repository is intended to host tools and demos for ActivityNet☆969Mar 21, 2024Updated 2 years ago
- Scripts for downloading the AVA (Atomic Visual Actions) dataset https://research.google.com/ava/ and do postprocessing of it.☆29May 2, 2019Updated 6 years ago
- Preprocessing tools for Google AVA Dataset☆49Apr 27, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for the Active Speakers in Context Paper (CVPR2020)☆58May 19, 2021Updated 4 years ago
- Caffe: a fast open framework for deep learning.☆106Feb 27, 2018Updated 8 years ago
- ☆20Updated this week
- Spatio-Temporal Action Localization System☆425May 21, 2022Updated 3 years ago
- GPU implementation of improved dense trajectory☆10Apr 14, 2015Updated 10 years ago
- You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization☆902Oct 28, 2024Updated last year
- An open-source toolbox for action understanding based on PyTorch☆1,875Apr 8, 2022Updated 3 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Dec 19, 2017Updated 8 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,321Mar 16, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Convolutional neural network model for video classification trained on the Kinetics dataset.☆1,825Sep 12, 2019Updated 6 years ago
- [CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization☆214Oct 8, 2021Updated 4 years ago
- This repository host the code for real-time action detection paper☆320Feb 23, 2021Updated 5 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,195Jul 11, 2024Updated last year
- Real-time Action detection demo for the work Actor Conditioned Attention Maps. This repo includes a complete pipeline for person detectio…☆154Dec 8, 2022Updated 3 years ago
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- ☆31Jan 6, 2019Updated 7 years ago
- Code for Oops! Predicting Unintentional Action in Video☆80Apr 13, 2020Updated 5 years ago
- VMZ: Model Zoo for Video Modeling☆1,053Jun 17, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Non-local Neural Networks for Video Classification☆1,993Sep 15, 2021Updated 4 years ago
- Code & Models for Temporal Segment Networks (TSN) in ECCV 2016☆1,578Oct 27, 2020Updated 5 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- Temporal Relation Networks☆791May 6, 2021Updated 4 years ago
- A curated list of action recognition and related area resources☆3,991May 13, 2023Updated 2 years ago
- Long-Term Feature Banks for Detailed Video Understanding☆384Aug 30, 2021Updated 4 years ago
- ☆952May 15, 2024Updated last year
- ☆30Apr 10, 2018Updated 7 years ago
- Temporal Segment Networks (TSN) in PyTorch☆1,085Jun 21, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for Temporal Relation Networks☆24Dec 30, 2017Updated 8 years ago
- ☆83Feb 20, 2021Updated 5 years ago
- Mini-Kinetics-200 data splits used in paper "Rethinking Spatiotemporal Feature Learning For Video Understanding"☆80Dec 24, 2017Updated 8 years ago
- ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'☆459Oct 23, 2023Updated 2 years ago
- Inflated i3d network with inception backbone, weights transfered from tensorflow☆548May 23, 2024Updated last year
- Context-aware RCNN: a Baseline for Action Detection in Videos☆51Oct 13, 2020Updated 5 years ago
- PyTorch implementation of "SlowFast Networks for Video Recognition".☆351Mar 13, 2019Updated 7 years ago