Chuhanxx/helping_hand_for_egocentric_videos

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Chuhanxx/helping_hand_for_egocentric_videos)

Chuhanxx / helping_hand_for_egocentric_videos

Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'

☆33

Alternatives and similar repositories for helping_hand_for_egocentric_videos

Users that are interested in helping_hand_for_egocentric_videos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ninatu / in_style
View on GitHub
Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023
☆11Oct 5, 2023Updated 2 years ago
ninatu / howtocaption
View on GitHub
Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024
☆59Aug 19, 2025Updated 11 months ago
epic-kitchens / VISOR-VOS
View on GitHub
☆13Nov 2, 2023Updated 2 years ago
TRI-ML / VOST
View on GitHub
Code for the VOST dataset
☆26Oct 1, 2023Updated 2 years ago
srijandas07 / clip_baseline_LTA_Ego4d
View on GitHub
Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)
☆15Jul 4, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
epic-kitchens / VISOR-HOS
View on GitHub
Code for recreating the HoS benchmark of VISOR
☆24Jul 2, 2023Updated 3 years ago
soCzech / MultiTaskObjectStates
View on GitHub
Code for the paper "Multi-Task Learning of Object States and State-Modifying Actions from Web Videos" published in TPAMI
☆11Mar 3, 2024Updated 2 years ago
florianHofherr / PhysParamInference
View on GitHub
☆19Jan 30, 2023Updated 3 years ago
dominickrei / PoseAwareVT
View on GitHub
Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
☆21Aug 2, 2024Updated last year
Charlotte-CharMLab / Fibottention
View on GitHub
Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"
☆17Oct 6, 2025Updated 9 months ago
yoxu515 / VIPOSeg-Benchmark
View on GitHub
The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".
☆12Oct 17, 2023Updated 2 years ago
fuqichen1998 / SequentialVotingDet
View on GitHub
[CVPR 2022] Sequential Voting with Relational Box Fields for Active Object Detection
☆10Jun 19, 2022Updated 4 years ago
Sid2697 / HOI-Ref
View on GitHub
Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"
☆30Apr 16, 2024Updated 2 years ago
jalayrac / object-states-action
View on GitHub
Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017
☆14Aug 7, 2018Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
sangmin-git / MMSI
View on GitHub
Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)
☆19Jun 23, 2024Updated 2 years ago
ADL-X / LLAVIDAL
View on GitHub
This is the offical repository of LLAVIDAL
☆25Oct 4, 2025Updated 9 months ago
IIT-PAVIS / Positional_Diffusion
View on GitHub
Code for "Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models"
☆19Mar 21, 2023Updated 3 years ago
shuheikurita / RefEgo
View on GitHub
☆13Jul 20, 2024Updated 2 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
zhaoyue-zephyrus / AVION
View on GitHub
[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"
☆138Aug 23, 2025Updated 11 months ago
zhaoyue-zephyrus / TeSTra
View on GitHub
Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"
☆119Aug 23, 2025Updated 11 months ago
jylins / hourllava
View on GitHub
[NeurIPS 2025 Spotlight] Unleashing Hour-Scale Video Training for Long Video-Language Understanding
☆19Jun 24, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Fsoft-AIC / Z-GMOT
View on GitHub
[NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking
☆12May 19, 2026Updated 2 months ago
showlab / EgoVLP
View on GitHub
[NeurIPS 2022] Egocentric Video-Language Pretraining
☆261May 9, 2024Updated 2 years ago
ut-vision / ActionVOS
View on GitHub
[ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation
☆32Dec 4, 2024Updated last year
shvdiwnkozbw / SSL-UVOS
View on GitHub
[ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation
☆34Mar 7, 2025Updated last year
dominickrei / Limited-data-vits
View on GitHub
[WACV 2024] Code for "Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders"
☆25Aug 16, 2024Updated last year
minghangz / OnVTG
View on GitHub
Online video temporal grounding
☆16Oct 20, 2025Updated 9 months ago
ut-vision / S2DHand
View on GitHub
☆34Jul 17, 2024Updated 2 years ago
yuggiehk / CaRe-Ego
View on GitHub
The official implement of Paper - CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation
☆19Aug 12, 2025Updated 11 months ago
xingaoli / DP-HOI
View on GitHub
Disentangled Pre-training for Human-Object Interaction Detection
☆28Sep 17, 2025Updated 10 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
gorkaydemir / SOLV
View on GitHub
[NeurIPS 2023] Self-supervised Object-Centric Learning for Videos
☆32Nov 28, 2024Updated last year
yoxu515 / MITS
View on GitHub
☆21Jul 25, 2024Updated 2 years ago
thearkaprava / MS-Temba
View on GitHub
[CVPR 2026] Official Repository of 'MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos'
☆48Jun 22, 2026Updated last month
owenzlz / EgoHOS
View on GitHub
Fine-Grained Egocentric Hand-Object Segmentation, ECCV 2022
☆144Feb 26, 2024Updated 2 years ago
tomchen-ctj / OST
View on GitHub
【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition
☆39Apr 27, 2024Updated 2 years ago
jinxiang-liu / anno-free-AVS
View on GitHub
Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"
☆38Oct 11, 2024Updated last year
lxtGH / TemporalPyramidRouting
View on GitHub
Temporal Pyramid Routing For Video Instance Segmentation-T-PAMI-2022
☆25Jul 6, 2023Updated 3 years ago