[CVPR 2023] STMixer: A One-Stage Sparse Action Detector
☆63May 18, 2023Updated 2 years ago
Alternatives and similar repositories for STMixer
Users that are interested in STMixer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆39Sep 27, 2023Updated 2 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 8 months ago
- [ICCV 2021] MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions☆134Aug 4, 2023Updated 2 years ago
- [ICCV 2023] Deep Equilibrium Object Detection☆27Jun 18, 2025Updated 9 months ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆69Feb 3, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling☆21Jul 7, 2025Updated 9 months ago
- [TIP] APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Recognition☆13May 15, 2023Updated 2 years ago
- [ECCV 2020] Actions as Moving Points☆271Dec 19, 2020Updated 5 years ago
- Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”☆71Jan 9, 2025Updated last year
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆33Aug 30, 2023Updated 2 years ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆21Feb 22, 2024Updated 2 years ago
- [ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model☆24Nov 1, 2024Updated last year
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆26Oct 16, 2023Updated 2 years ago
- [CVPR 2022 Oral] AdaMixer: A Fast-Converging Query-Based Object Detector☆237Aug 17, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Spatio-Temporal Action Localization System☆426May 21, 2022Updated 3 years ago
- LSTC: Boosting Atomic Action Detection with Long-Short-Term Context☆10Sep 1, 2022Updated 3 years ago
- Context-aware RCNN: a Baseline for Action Detection in Videos☆51Oct 13, 2020Updated 5 years ago
- [CVPR 2022] Task-specific Inconsistency Alignment for Domain Adaptive Object Detection☆40Jul 20, 2022Updated 3 years ago
- [ICCV 2023] SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes☆202Jul 24, 2023Updated 2 years ago
- [NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking☆208Apr 20, 2024Updated last year
- Custom ava dataset, Multi-Person Video Dataset Annotation Method of Spatio-Temporally Actions☆134Jun 7, 2022Updated 3 years ago
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆38Aug 20, 2024Updated last year
- [ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization☆12Oct 8, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆47Nov 24, 2023Updated 2 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- [ICCV2023] MixSort: The Customized Tracker in SportsMOT☆93Aug 21, 2023Updated 2 years ago
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆35Dec 23, 2024Updated last year
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆30Feb 4, 2026Updated 2 months ago
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆53Mar 5, 2024Updated 2 years ago
- [ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay☆43Jun 26, 2025Updated 9 months ago
- Hopenet: deep head pose estimator on ncnn☆10Jun 18, 2020Updated 5 years ago
- The second generation of YOWO action detector.☆283May 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆50Jul 6, 2022Updated 3 years ago
- ☆20Jan 29, 2023Updated 3 years ago
- [ICCV 2021] Self Supervision to Distillation for Long-Tailed Visual Recognition☆21Feb 9, 2022Updated 4 years ago
- download AVA dataset☆22Sep 5, 2023Updated 2 years ago
- [CVPR 2023] LinK: Linear Kernel for LiDAR-based 3D Perception☆97Jul 27, 2024Updated last year
- ☆11Jul 30, 2025Updated 8 months ago
- STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)☆253Oct 19, 2019Updated 6 years ago