"Object-Region Video Transformers”, Herzig et al., CVPR 2022
☆50Jul 6, 2022Updated 3 years ago
Alternatives and similar repositories for ORViT
Users that are interested in ORViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Object-Region Video Transformers☆24Mar 24, 2022Updated 4 years ago
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆27Apr 3, 2022Updated 4 years ago
- [ACM MM 2021] A causal perspective for compositional action recognition, providing a counterfactual debiasing inference implementation to…☆20May 5, 2022Updated 3 years ago
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆148Aug 25, 2023Updated 2 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 【ACMMM'2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning☆41Jul 7, 2021Updated 4 years ago
- This repository is a fork of https://github.com/joslefaure/HIT customized for the AVA dataset☆17Jun 17, 2023Updated 2 years ago
- [NeurIPS 2023] Learning Motion Refinement for Unsupervised Face Animation☆40Dec 3, 2023Updated 2 years ago
- N-EPIC-Kitchens: The event-based camera extension of the large-scale EPIC-Kitchens dataset.☆23May 10, 2022Updated 3 years ago
- ☆13Nov 29, 2021Updated 4 years ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆21Jul 20, 2024Updated last year
- A zero-shot captcha solver.☆16Dec 22, 2023Updated 2 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆31Sep 5, 2023Updated 2 years ago
- Materials for PyCon 2016 in Portland, Oregon☆10Aug 30, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LSTC: Boosting Atomic Action Detection with Long-Short-Term Context☆10Sep 1, 2022Updated 3 years ago
- Slide and notebook used for my talk on vaex at the Pandas summit 2019 @ Lodnon☆11Jun 13, 2019Updated 6 years ago
- Video-Language Alignment via Spatio–Temporal Graph Transformer; ArXiv: https://arxiv.org/abs/2407.11677☆14Jul 24, 2024Updated last year
- ☆12Aug 5, 2022Updated 3 years ago
- Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018☆170Sep 11, 2018Updated 7 years ago
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆20Aug 5, 2025Updated 8 months ago
- Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"☆12Mar 26, 2026Updated last month
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63May 18, 2023Updated 2 years ago
- Official project of DiverseSampling (ACMMM2022 Paper)☆16Feb 25, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🚴♂️ ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection (MM 2020)☆35Jul 2, 2025Updated 10 months ago
- Implementation of the paper Video Action Transformer Network☆138Apr 5, 2021Updated 5 years ago
- Is Depth Really Necessary for Salient Object Detection? ACM MM 2020☆22May 30, 2024Updated last year
- This repository contains the codebase mentioned and used in trains' blogs☆11Jul 25, 2025Updated 9 months ago
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆28Jan 21, 2026Updated 3 months ago
- Distributed Training of Bayesian Neural Networks at Scale☆11May 26, 2020Updated 5 years ago
- Evaluation measures for the EPIC-KITCHENS-100 Action Detection challenge☆16Feb 10, 2026Updated 2 months ago
- Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021☆49Dec 7, 2021Updated 4 years ago
- DL4CV book☆10Sep 18, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A UI automation engine☆11Mar 20, 2026Updated last month
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 4 years ago
- [CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation☆32Oct 18, 2024Updated last year
- A simple tkinter GUI for illustrating DFS and BFS.☆12Jun 26, 2020Updated 5 years ago
- A tookbox for evaluating salient object detection algorithms☆21Jan 20, 2014Updated 12 years ago
- A simple way to transport dynamic data over ROS comms☆17Updated this week
- A Rideshare Simulation built in C++, using OpenStreetMap data☆14Oct 24, 2021Updated 4 years ago