This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection
☆96Apr 14, 2023Updated 3 years ago
Alternatives and similar repositories for tubelet-transformer
Users that are interested in tubelet-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is a fork of https://github.com/joslefaure/HIT customized for the AVA dataset☆17Jun 17, 2023Updated 2 years ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆21Feb 22, 2024Updated 2 years ago
- Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”☆72Jan 9, 2025Updated last year
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆39Sep 27, 2023Updated 2 years ago
- download AVA dataset☆23Sep 5, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63May 18, 2023Updated 3 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- LSTC: Boosting Atomic Action Detection with Long-Short-Term Context☆10Sep 1, 2022Updated 3 years ago
- [ECCV 2020] Actions as Moving Points☆272Dec 19, 2020Updated 5 years ago
- ☆113Nov 3, 2022Updated 3 years ago
- Custom ava dataset, Multi-Person Video Dataset Annotation Method of Spatio-Temporally Actions☆134Jun 7, 2022Updated 3 years ago
- The second generation of YOWO action detector.☆287May 9, 2024Updated 2 years ago
- The code is for the CVPR 2019 paper 'Dance with Flow: Two-in-One Stream for Action Detection '☆32Nov 21, 2022Updated 3 years ago
- You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization☆911Oct 28, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Context-aware RCNN: a Baseline for Action Detection in Videos☆51Oct 13, 2020Updated 5 years ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆70Feb 3, 2023Updated 3 years ago
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆119Aug 23, 2025Updated 9 months ago
- Code release for ActionFormer (ECCV 2022)☆559Apr 11, 2024Updated 2 years ago
- [TIP 2022] End-to-end Temporal Action Detection with Transformer☆166Feb 19, 2023Updated 3 years ago
- [ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"☆45Jan 17, 2022Updated 4 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆114Aug 3, 2023Updated 2 years ago
- [CVPR2022] MS-TCT☆54Oct 8, 2022Updated 3 years ago
- [CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection☆87Feb 19, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆32Apr 3, 2026Updated last month
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆10Apr 19, 2024Updated 2 years ago
- Caffe: a fast open framework for deep learning.☆106Feb 27, 2018Updated 8 years ago
- [NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection☆140Jul 25, 2024Updated last year
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆5,036Mar 18, 2026Updated 2 months ago
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- Code base for zero-shot action localization through spatial-aware object embeddings☆25Nov 3, 2017Updated 8 years ago
- [AAAI 2022] DCAN: Improving Temporal Action Detection via Dual Context Aggregation☆17Nov 13, 2022Updated 3 years ago
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆609Dec 6, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆791Oct 8, 2024Updated last year
- A simple and effective feature extractor for untrimmed videos☆13Sep 1, 2022Updated 3 years ago
- STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)☆253Oct 19, 2019Updated 6 years ago
- Code for our CVPR 2021 paper "Coarse-Fine Networks for Temporal Activity Detection in Videos"☆57Oct 10, 2021Updated 4 years ago
- ☆195Oct 22, 2022Updated 3 years ago
- ☆14Sep 19, 2016Updated 9 years ago
- ☆12Oct 21, 2019Updated 6 years ago