"Object-Region Video Transformers”, Herzig et al., CVPR 2022
☆50Jul 6, 2022Updated 3 years ago
Alternatives and similar repositories for ORViT
Users that are interested in ORViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Object-Region Video Transformers☆24Mar 24, 2022Updated 4 years ago
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆27Apr 3, 2022Updated 4 years ago
- [ACM MM 2021] A causal perspective for compositional action recognition, providing a counterfactual debiasing inference implementation to…☆20May 5, 2022Updated 4 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- AFNet(NeurIPS 2022)☆20Nov 24, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- N-EPIC-Kitchens: The event-based camera extension of the large-scale EPIC-Kitchens dataset.☆23May 10, 2022Updated 4 years ago
- ☆13Nov 29, 2021Updated 4 years ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆22Jul 20, 2024Updated last year
- [ICCV 2023] Official implementation of paper "SOAR: Scene-debiasing Open-set Action Recognition".☆12Dec 23, 2023Updated 2 years ago
- LSTC: Boosting Atomic Action Detection with Long-Short-Term Context☆10Sep 1, 2022Updated 3 years ago
- Official TensorFlow code for the paper "DeepWay: a Deep Learning Waypoint Estimator for Global Path Generation".☆11Jun 24, 2022Updated 3 years ago
- Video-Language Alignment via Spatio–Temporal Graph Transformer; ArXiv: https://arxiv.org/abs/2407.11677☆15Jul 24, 2024Updated last year
- ☆12Aug 5, 2022Updated 3 years ago
- Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018☆170Sep 11, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63May 18, 2023Updated 3 years ago
- Official project of DiverseSampling (ACMMM2022 Paper)☆16Feb 25, 2023Updated 3 years ago
- 🚴♂️ ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection (MM 2020)☆35Jul 2, 2025Updated 10 months ago
- CapsNet implementation in a minimal manner☆11Nov 17, 2017Updated 8 years ago
- Is Depth Really Necessary for Salient Object Detection? ACM MM 2020☆22May 30, 2024Updated last year
- Distributed Training of Bayesian Neural Networks at Scale☆11May 26, 2020Updated 5 years ago
- Implementation of 3D attention mechanisms based on https://github.com/LeftAttention/Attention-Codebase. Thanks to LeftAttetnion for shari…☆12Feb 22, 2022Updated 4 years ago
- Evaluation measures for the EPIC-KITCHENS-100 Action Detection challenge☆16Feb 10, 2026Updated 3 months ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A simple tkinter GUI for illustrating DFS and BFS.☆12Jun 26, 2020Updated 5 years ago
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- A tookbox for evaluating salient object detection algorithms☆21Jan 20, 2014Updated 12 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆16Apr 22, 2019Updated 7 years ago
- A Rideshare Simulation built in C++, using OpenStreetMap data☆14Oct 24, 2021Updated 4 years ago
- Code for the paper "Generalizing Hand Segmentation in Egocentric Videos with Uncertainty-Guided Model Adaptation"☆36Aug 28, 2020Updated 5 years ago
- A Probabilistic Programming Language in 70 lines of Python. Code for the blog post https://mrandri19.github.io/2022/01/12/a-PPL-in-70-lin…☆19Feb 10, 2022Updated 4 years ago
- ☆27Oct 11, 2024Updated last year
- ☆109Dec 23, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code and Dataset for our CVPR 2022 paper "Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training"☆12Jul 8, 2022Updated 3 years ago
- Multi-head Recurrent Layer Attention for Vision Network☆22Mar 2, 2023Updated 3 years ago
- Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”☆72Jan 9, 2025Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,737Dec 8, 2023Updated 2 years ago
- ☆14Mar 31, 2022Updated 4 years ago
- LinkedIn Web Scraper☆10Mar 3, 2021Updated 5 years ago
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago