PyTorch implementation of X3D models with Multigrid training.
☆101Oct 10, 2021Updated 4 years ago
Alternatives and similar repositories for X3D-Multigrid
Users that are interested in X3D-Multigrid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for our CVPR 2021 paper "Coarse-Fine Networks for Temporal Activity Detection in Videos"☆57Oct 10, 2021Updated 4 years ago
- [CVPR 2020] X3D: Expanding Architectures for Efficient Video Recognition☆23Nov 3, 2020Updated 5 years ago
- Code for our WACV 2021 paper "Exploiting the Redundancy in Convolutional Filters for Parameter Reduction"☆11Jan 6, 2021Updated 5 years ago
- [WIP] Code for LangToMo☆21Mar 19, 2026Updated last month
- Code for : [Pattern Recognit. Lett. 2021] "Learn to cycle: Time-consistent feature discovery for action recognition" and [IJCNN 2021] "Mu…☆68Aug 31, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Jun 25, 2022Updated 3 years ago
- [Codes of paper]: Busy-Quiet Video Disentangling for Video Classification☆14Jan 17, 2022Updated 4 years ago
- This is the pytorch implementation of some representative action recognition approaches including I3D, S3D, TSN and TAM.☆257Oct 8, 2021Updated 4 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,354Mar 16, 2026Updated last month
- ☆11Nov 5, 2024Updated last year
- [ICLR2021] official implementation of CT-Net☆37Dec 29, 2021Updated 4 years ago
- Official PyTorch Implementation of MotionSqueeze, ECCV 2020☆139Oct 14, 2021Updated 4 years ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 4 years ago
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆36Jun 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyTorch 3D video classification models pre-trained on 65 million Instagram videos☆265Dec 7, 2019Updated 6 years ago
- [Main Conference @ EACL'26] [Workshop @ NeurIPS'24] 🎞️ LVNet.☆43Feb 10, 2026Updated 2 months ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- Implementations of some few-shot action recognition methods.☆43Jun 7, 2021Updated 4 years ago
- ☆87Mar 4, 2024Updated 2 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆130Aug 31, 2021Updated 4 years ago
- ☆15Apr 18, 2022Updated 4 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆55Mar 30, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition☆17Mar 26, 2024Updated 2 years ago
- ☆18Dec 17, 2022Updated 3 years ago
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆5,004Mar 18, 2026Updated last month
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Apr 20, 2023Updated 3 years ago
- Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers☆21Aug 2, 2024Updated last year
- Attribution (or visual explanation) methods for understanding video classification networks. Demo codes for WACV2021 paper: Towards Visua…☆21Oct 3, 2023Updated 2 years ago
- [ECCV2020] Learn optimal resolution and skipping mechanism for efficient video understanding☆63Aug 17, 2020Updated 5 years ago
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆56Jan 31, 2025Updated last year
- This is an official implementation for "Video Swin Transformers".☆1,651Mar 8, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fast CUDA implementation of (differentiable) otam for PyTorch using Numba☆16Jun 21, 2021Updated 4 years ago
- MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;☆291May 26, 2022Updated 3 years ago
- Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…☆14Dec 9, 2021Updated 4 years ago
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 3 years ago
- ☆20Nov 29, 2021Updated 4 years ago
- ICCV 19 Grouped Spatial-Temporal Aggretation for Efficient Action Recognition☆43Oct 14, 2019Updated 6 years ago
- An implementation of the X3D video recognition architecture in TensorFlow/Keras☆16May 17, 2021Updated 4 years ago